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G PROTE1N--COUPLED RECEPTORS EXPRESSED IN BRAIN 

RELATED APPLICATIONS 

This patent application is a continuation-in-part of the following U.S. 
patent applications: Serial No. 09/481,794 filed January 12, 2000; Serial No. 
09/454,399 filed December 3, 1999; Serial Nos. 09/429,5 17, 09/429,555, 09/429,676, 
09/429, 695 filed October 28, 1999; and Serial Nos. 09/428,1 14, 09/428,020, 
09/427,859 and 09/427,653 filed October 27, 1999. All these application are 
incorporated herein by reference. 

FIELD OF THE INVENTION 

The present invention relates generally to the fields of genetics and 
cellular and molecular biology. More particularly, the invention relates to a novel G 
protein-coupled seven transmembrane receptor polynucleotide and polypeptide 
sequences that arc expressed in the brain, 

DESCRIPTION OF RELATED ART 

Humans and other life forms are comprised of living cells. Among the 
mechanisms through which the cells of an organism communicate with each other and 
obtain information and stimali from their environment is through eel! membrane 
receptor molecules expressed on the cell surface. Many such receptors have been 
identified, characterized, and sometimes classified into major receptor superfamilies 
based on structural motifs and signal transduction features. Such families include (but 
are not limited to) ligand-gated ion channel receptors, voltage-dependent ion channel 
receptors, receptor tyrosine kinases, receptor protein tyrosine phosphatases, and G 
protein-coupled receptors. The receptors are a first essential link for translating an 
extracellular signal into a cellular physiological response. 

The G protein-coupled receptors (GPCR) form a vast superfamily of 
cell surface receptors which are characterized by an amino-terminal extracellular 
domain, a carboxyl-terminal intracellular domain, and a serpentine structure that 
passes through the cell membrane seven times. Hence, such receptors are sometimes 
also referred to as seven transmembrane (7TM) receptors. These seven 
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Iransmembrane domains define three extracellular loops and three inlraceilular loops, 
in addition to the amino- and carboxyl-temiinal domains. The extracellular portions 
of the receptor have a role in recognizing and binding one or more extracellular 
binding partners (iigands), whereas the intracellular portions have a role in 
5 recognizing and communicating with downstream effector molecules. 

The G protein-coupled receptors bind a variety of Hgands including 
calcium ions, hormones, chemokines, neuropeptides, neurotransmitters, nucleotides, 
lipids, odorants, and even photons, and are important in the normal (and sometimes 
the aberrant) function of many cell types. [See generally A.D. Strosberg, Eur. J. 

10 Biochem., J96: 1-10 (1991) and S. K. Bohm a/.. BiochemJ,, 322: 1-18(1997).] 

' When a specific ligand binds to its corresponding receptor, the ligand stimulates the 
receptor to activate a specific heterotrimeric guanine-nucJeotide-binding regulatory 
protein (G-protein) that is coupled to the intracellular portion of the receptor. The G 
protein in turn transmits a signal to an effector molecule within the cell, by either 

1 5 stimulating or inhibiting the activity of that effector molecule. These effector 

molecules include adenylate cyclase, phospholipases, and ion channels. Adenylate 
cyclase and phospholipases are enzymes that are involved in the production of the 
second messenger molecules cAMP, inositol triphosphate and diacyglycerol. It is 
through this sequence of events that an extracellular ligand stimuli exerts intracellular 

20 changes through a G protein-coupled receptor. Each such receptor has its own 

characteristic primary structure, expression pattern, ligand-binding profile, and 
intracellular effector system. 

Because of the vital role of G protein-coupled receptors in the 
communication between cells and their environment, such receptors are attractive 

25 targets for therapeutic intervention, and many drugs have been registered which are 

directed towards activating or antagonizing such receptors. For receptors having a 
known ligand, the identification of agonists or antagonists may be sought specifically 
for enhancing or inhibiting the action of the ligand. Some G protein-coupled 
receptors have roles in disease pathogenesis (e.g., certain chemokine receptors that act 

30 as HTV co-receptors and may have a role in AIDS pathogenesis), and are attractive 
targets tor tlierapeutic intervention even in the absence of knowledge of the natural 
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ligand of the receptor. Other receptors are attractive targets for therapeutic 
intervention by virtue of their expression pattern in tissues or cell types that are 
attractive targets for therapeutic intervention. Examples of this latter category of 
receptors include receptors expressed in immune cells, for targeting to enhance 
5 immune responses to fight pathogens or cancer or inhibit autoimmune responses; and 

receptors expressed in the brain or other neurons, for targeting to treat schizophrenia, 
depression, bipolar disease, or other neurological disorders. This latter category of 
receptor is also useful as a marker for identifying and/or purifying (e.g., via 
fluorescence activated cell sorting) cellular subtypes that express the receptor. 
10 Unfortunately, only a limited number of G protein receptors from the central nervous 

system (CNS) are known. A need exists for identifying the existence and structure of 
such G protein-coupled receptors. 



SUMMARY OF THE INVENTION 

15 The present invention addresses one or more of the needs identified 

above in that it provides purified polynucleotides encoding heretofore unknown G 
protein-coupled receptors (GPCR); constructs and recombinant host cells 
incorporating the polynucleotides; GPCR polypeptides encoded by the 
polynucleotides; antibodies to the polypeptides; and methods of making and using all 

20 of the foregoing. As set forth in detail herein, the GPCR polypeptides described 
herein are expressed in the brain, providing a therapeutic indication for GPCR 
polypeptides and binding partners to treat diseases associated with this tissue. 

The invention provides purified and isolated GPCR seven 
transmembrane receptor polypeptides comprising any one of the amino acid 

25 sequences set forth in SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18 or 20, or a fragment 

thereof comprising an epitope specific to the seven transmembrane receptor. By 
"epitope specific to" is meant a portion of the receptor that is recognizable by an 
antibody that is specific for that seven transmembrane receptor, as defined in detail 
below. 

30 One preferred embodiment comprises a purified and isolated 

polypeptide designated CON193, comprising the complete amino acid sequence set 
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forth in SEQ ID NO: 2. This amino acid sequence was deduced from a 
polynucleotide sequence encoding CON 1 93 (SEQ ID N0:1), as set forth below: 
ntggttgttg gaccattaaa atgcattatg gaatttttaa aagttggggg agagggagac 60 
agtaaaaata acctatatct tctcttgttt tttttttttt aactctagga aagcccagac 1.20 
5 aaattttgag ctatttcata acctaccaga cttatc atg eta aca ctg aat aaa 174 

Met Leu Thr Leu Asn Lys 
1 5 

aca gac eta ata eea get tea ttt att ctg aat gga gtc cca gga ctg 222 
Thr Asp Leu He Pro Ala Ser Phe He Leu Asn Gly Val Pro Gly Leu 
10 10 15 20 

gaa gac aca caa etc tgg att tee ttc cca ttc tgc tct atg tat gtt 270 
Glu Asp Thr Gin Leu Trp He Ser Phe Pro Phe Cys Ser Met Tyr Val 

25 30 35 

gtg get atg gta ggg aat tgt gga etc etc tac etc att cac tat gag 318 
15 Val Ala Met Val Gly Asn Cys Gly Leu Leu Tyr Leu He His Tyr Glu 

40 45 SO 

gat gee ctg cac aaa cec atg tac tac ttc ttg gcc atg ctt tec ttt 366 
Asp Ala Leu His Lys Pro Met Tyr Tyr Phe Leu Ala Met Leu Ser Phe 
55 60 65 70 

20 act gae ett gtt atg tgc tct agt aca ate cet aaa gcc etc tge ate 414 

Thr Asp Leu Val Met Cys Ser Ser Thr He Pro Lys Ala Leu Cys He 

75 80 85 

ttc tgg ttt cat etc aag gac att gga ttt gat gaa tgc ctt gtc cag 462 
Phe Trp Phe His Leu Lys Asp He Gly Phe Asp Glu Cys Leu Val Gin 
25 90 95 100 

atg ttc ttc ate cac acc ttc aca ggg atg gag tct ggg gtg ctt atg 510 
Met Phe Phe He His Thr Phe Thr Gly Met Glu Ser Gly Val Leu Met 

105 110 115 

ett atg gcc .ctg gat egc tat gtg gcc ate tge tac cec tta egc tat 558 
30 Leu Met Ala Leu Asp Arg Tyr Val Ala He Cys Tyr Pro Leu Arg Tyr 

120 125 130 

tea act ate etc acc aat cct gta att gca aag gtt ggg act gcc acc 606 
Ser Thr He Leu Thr Asn Pro Val He Ala Lys Val Gly Thr Ala Thr 
135 140 145 150 

35 ttc ctg aga ggg gta tta etc att att cec ttt act ttc etc ace aag 654 

Phe Leu Arg Gly Val Leu Leu He He Pro Phe Thr Phe Leu Thr Lys 

155 160 165 

egc etg cec tec tge aga ggc aat ata ett cec cat ace tac tgt gac 702 
Arg Leu Pro Ser Cys Arg Gly Asn He Leu Pro His Thr Tyr Cys Asp 
40 170 175 180 
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cac atg tct gta gcc aaa ttg tec tgt ggt aat gtc aag gtc aat gcc 750 
His Met Ser Val Ala Lys 'Leu Ser Cys Gly Asn Val Lys Val Asn Ala 

185 190 195 

ate tat ggt ctg atg gtt gcc etc ctg att ggg ggc ttt gac ata ctg 798 
5 He Tyr Gly Le\i Met Val Ala Leu Leu He Gly Gly Phe Asp He Leu 

200 205 210 

tgt ate ace ate tee tat acc atg att etc egg gca gtg gtc age etc 646 
Cys He Thr He Ser Tyr Thr Met He Leu Arg Ala Val Val Ser Leu 
215 220 225 230 

10 tec tea gca gat get egg cag aag gcc ttt aat acc tgc act gcc cac 894 

Ser Ser Ala Asp Ala Arg Gin Lys Ala Phe Asn Thr Cys Thr Ala His 

235 240 245 

att tgt gee att gtt ttc tec tat' act cea get tte ttc tec tte ttt 942 
He Cys Ala He Val Phe Ser Tyr Thr Pro Ala Phe Phe Ser Phe Phe 
15 250 255 260 

tec cac egc ttt ggg gaa cac ata ate ccc cct tct tgc cac ate att 990 
Ser His Arg Phe Gly Glu His He He Pro Pro Ser Cys His He He 

265 270 275 

gta gcc aat att tat ctg etc eta eca ccc act atg aac cct att gtc 1038 
20 Val Ala Asn He Tyr Leu Leu Leu Pro Pro Thr Met Asn Pro He Val 

280 285 290 

tat ggg gtg aaa acc aaa cag ata ega gac tgt gtc ata agg ate ett 1086 
Tyr Gly Val Lys Thr Lys Gin He Arg Asp Cys Val He Arg He Leu 
295 300 305 310 

25 tea ggt tct aag gat ace aaa tec tac age atg tga atgaacaett 1132 

Ser Gly Ser Lys Asp Thr Lys Ser Tyr Ser Met 

315 320 
gccaggagtg agaagagaag gaaagaatta cttctatttg cctcttatgc aggagtteatll92 
aaaatcttte tggaagtaet gtattgatca caaaatggag tttgntgact ggtgeattc 1252 
30 caataagtac cttgggaatc tnacatcact ggaaggccca ccaeatttct ataaat 13 08 

Another preferred embodiment comprises a purified and isolated 
polypeptide designated CON! 66, comprising the complete amino acid sequence set 
forth in SEQ K) NO: 4. This amino acid sequence was deduced from a 
polynucleotide sequence encoding CONl 66 (SEQ ID NO: 3), as set forth below: 

35 atg gat gaa aca gga aat ctg aca gta tct tct gcc aca tgc cat gac 48 

Met Asp Glu Thr Gly Asn Leu Thr Val Ser Ser Ala Thr Cys His Asp 

15 10 15 

act att gat gac ttc egc aat caa gtg tat tee acc ttg tac tct atg 96 
Thr He Asp Asp Phe Arg Asn Gin Val Tyr Ser Thr Leu Tyr Ser Met 

40 20 25 30 
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ate tct gtt gta ggc ttc ttt ggc aat ggc ttt gtg ccc tat gtc etc 144 
lie Ser Val Val Gly Phe Phe Gly Asn Gly Phe Val Leu Tyr Val Leu 

35 . 40 45 

ata aaa acc tat cac aag aag tea gcc ttc caa gta tac atg att aat 192 
5 He Lys Thr Tyr His Lys Lys Ser Ala Phe Gin Val Tyr Met He Asn 

50 55 60 

tta gca gta gca gat eta ctt tgt gtg tgc aca ctg cct etc egt gtg 240 
Leu Ala Val Ala Asp Leu Leu Cys Val Cys Thr Leu Pro Leu Arg Val 
65 70 75 80 

10 gtc tat tat gtt cac aaa ggc att tgg etc ttt ggt gac ttc ttg tge 288 

Val Tyr Tyr Val His Lys Gly He Trp Leu Phe Gly Asp Phe Leu Cys 

85 90 95 

egc etc age acc tat get ttg tat gtc aae etc tat tgt age ate ttc 336 
Arg Leu Ser Thr Tyr Ala Leu Tyr Val Asn Leu Tyr Cys Ser He Phe 
15 100 105 110 

ttt atg aca gcc atg age ttt ttc egg tgc att gca att gtt ttt cca 384 
Phe Met Thr Ala Met Ser Phe Phe Arg Cys He Ala He Val Phe Pro 

115 120 125 

gtc cag aae att aat ttg gtt aca eag aaa aaa gcc agg ttt gtg tgt 432 
20 Val Gin Asn He Asn Leu Val Thr Gin Lys Lys Ala Arg Phe Val Cys 

130 135 140 

gta ggt att tgg att ttt gtg att ttg acc agt tct cca ttt eta atg 480 
Val Gly He Trp He Phe Val He Leu Thr Ser Ser Pro Phe Leu Met 
145 150 155 160 

25 gcc aaa cca caa aaa gat gag aaa aat aat acc aag tgc ttt gag ccc 528 

Ala Lys Pro Gin Lys Asp Glu Lys Asn Asn Thr Lys Cys Phe Glu Pro 

165 170 175 

cca caa gac aat caa act aaa aat eat gtt ttg gtc ttg cat tat gtg 576 
Pro Gin Asp Asn Gin Thr Lys Asn His Val Leu Val Leu His Tyr Val 
30 * 180 185 190 

tea ttg ttt gtt ggc ttt ate ate cct ttt gtt att ata att gtc tgt 624 
. Ser Leu Phe Val Gly Phe He He Pro Phe Val He He He Val Cys 
195 200 205 

tac aca atg ate att ttg acc tta eta aaa aaa tea atg aaa aaa aat 672 
35 Tyr Thr Met He He Leu Thr Leu Leu Lys Lys Ser Met Lys Lys Asn 

210 215 220 

ctg tea agt cat aaa aag get ata gga atg ate atg gtc gtg acc get 720 
Leu Ser Ser Hi-s Lys Lys Ala He Gly Met He Met Val Val Thr Ala 
225 230 235 240 

gcc ttt tta gtc agt ttc atg cca tat cat att caa egt acc att cac 768 
Ala Phe Leu Val Ser Phe Met Pro Tyr His He Gin Arg Thr He His 
245 250 255 



40 
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ctt cat ttt tta cac aat gaa act aaa ccc tgt gat tct gtc ctt aga 816 
Leu His Phe Leu His Asn Glu Thr Lys Pro Cys Asp Ser Val Leu Arg 

260 265 270 

atg cag aag tec gtg gtc ata acc ttg tct ctg get gca tec aat tgt 864 
5 Met Gin Lys Ser Val Val He Thr Leu Ser Leu Ala Ala Ser Asn Cys 

275 280 285 

tgc ttt gac cct etc eta tat ttc ttt tct ggg ggt aae ttt agg aaa 912 
Cys Phe Asp Pro Leu Leu Tyr Phe Phe Ser Gly Gly Asn Phe Arg Lys 
290 295 300 

10 agg ctg tct aca ttt aga aag cat tct ttg tec age gtg act tat gta 960 

Arg Leu Ser Thr Phe Arg Lys His Ser Leu Ser Ser Val Thr Tyr Val 
305 310 315 320 

ccc aga aag aag gee tct ttg cca gaa aaa gga gaa gaa ata tgt aaa 1008 
Pro Arg Lys Lys Ala Ser Leu Pro Glu Lys Gly Glu Glu He Cys Lys 
15 325 330 335 

gta tag 1014 
Val 

Still another prefeired embodiment comprises a purified and isolated 
polypeptide designated CON103, comprising the complete amino acid sequence set 
20 forth in SEQ ID NO: 6. This amino acid sequence was deduced from a 

polynucleotide sequence encoding CON103 (SEQ ID NO: 5), as set forth below: 
ggggcctact tcaeegtgta cccggacttg ggaccatcac agaettcaga accatcagga 60 
acctgggage aactgaaagc tgaactacag tgggctttea gaeaeaeagc aggctgcgga 12 0 
gcacaaatag gactggttec ctccaggcca ceagcagggc ggtggaggtc ttcaetgact 180 
ccctgcctac ctctcaggac aatgtccttt tggctecaca gtccctgaag ccagagctgg 240 
tgggggcagg gaggcageca ccagcetcta tatgtagtgg aggagggggt gtecagggag 3 00 
ggctgeatga tcctgagagc ccceacctca cccggctgga ctatcctcce actteagggt 350 
ttctctgggc ttccatcttg cccctgctga gccctgcttc ctcctctacc agcagcacaa 420 
cecccaggct gggctcagag aecteatgtg gtgggatcae tcagtacccc gaggcggagg 480 
30 gaaggaggga gggctgeagg gttccccttg gcctgeaaac aggaacacag ggtgtttctc 540 

^gtggctgcg agaatgctga tgaaaaceec aggatgttgt gtcaeegtgg tggccagetg 600 
atagtgccaa tcatcccact ttgccctgag cactcctgca ggggtagaag actccagaac 660 
cttetctcag gcecatggec eaagcagccc atg gaa ctt eat aae ctg age tct 714 

Met Glu Leu His Asn Leu Ser Ser 
1 5 

cca tct ccc tct etc tec tec tct gtt etc eet ccc tec ttc tct ccc 762 
Pro Ser Pro Ser Leu Ser Ser Ser Val Leu Pro Pro Ser Phe Ser Pro 

10 15 20 

tea ccc tec tet get ccc tct gee ttt ace act gtg ggg ggg tec tct 810 
Ser Pro Ser Ser Ala Pro Ser Ala Phe Thr Thr Val Gly Gly Ser Ser 
25 30 35 40 



25 



35 



40 
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9ga ggg ccc cgc cac ccc acc tct tec teg ctg gtg tct gcc etc ctg 858 
Gly Gly Pro Cys His Pro Thr Ser Ser Ser Leu Val Ser Ala Phe Leu 

45 50 55 

gca cca ate ctg gcc ctg gag ttt gtc ctg ggc ctg gtg ggg aac agt 906 
5 Ala Pro lie Leu Ala Leu Glu Phe Val Leu Gly Leu Val Gly Asn Ser 

60 65 70 

ttg gcc etc ttc ate ttc tgc ate cac acg egg ccc tgg acc tee aac 954 
Leu Ala Leu Phe lie Phe Cys lie His Thr Arg Pro Trp Thr Ser Asn 
75 80 85 

10 acg gtg ttc ctg gtc age ctg gtg gee get gac ttc etc etg ate age 1002 

Thr Val Phe Leu Val Ser Leu Val Ala Ala Asp Phe Leu Leu lie Ser 

90 95 100 

aac ctg ccc etc cgc gtg gac tae tac etc etc cat gag acc tgg cgc 1050 
Asn Leu Pro Leu Arg Val Asp Tyr Leu Leu His Glu Thr Trp Arg 
15 105 110 115 120 

ttt ggg get get gcc tge aaa gtc aac etc ttc atg ctg tee ace aac 1098 
Phe Gly Ala Ala Ala Cys Lys Val Asn Leu Phe Met Leu Ser Thr Asn 

125 130 135 

cgc acg gee age gtt gtc ttc etc aca gcc ate gca etc aac cgc tac 1146 
20 Arg Thr Ala Ser Val Val Phe Leu Thr Ala lie Ala Leu Asn Arg Tyr 

140 145 150 

ctg aag gtg gtg eag ccc cac cac gtg etg age egt get tee gtg ggg 1194 
Leu Lys Val Val Gin Pro His His Val Leu Ser Arg Ala Ser Val Gly 
155 160 165 

25 gea get gcc egg gtg gcc ggg gga etc tgg gtg ggc ate ctg etc etc 1242 

Ala Ala Ala Arg Val Ala Gly Gly Leu Trp Val Gly lie Leu Leu Leu 

170 175 180 

aac ggg cac etg etc ctg age acc ttc tec ggc ccc tee tgc etc age 1290 
Asn Gly His Leu Leu Leu Ser Thr Phe Ser Gly Pro Ser Cys Leu Ser 
30 185 190 195 200 

tae agg gtg ggc acg aag eec teg gcc teg etc cgc tgg cac eag gca 1338 
Tyr Arg Val Gly Thr Lys Pro Ser Ala Ser Leu Arg Trp His Gin Ala 

205 210 215 

ctg tac ctg -ctg gag ttc ttc ctg cca ctg gcg etc ate etc ttt get 1386 
35 Leu Tyr Leu Leu Glu Phe Phe Leu Pro Leu Ala Leu tie Leu Phe Ala 

220 225 230 

att gtg age att ggg etc acc ate egg aac cgt ggt etg ggc ggg eag 1434 
lie Val Ser lie Gly Leu Thr lie Arg Asn Arg Gly Leu Gly Gly Gin 

235 240 245 

gea ggc ccg cag agg gcc atg cgt gtg ctg gcc atg gtg gtg gee gtc 1482 
Ala Gly Pro Gin Arg Ala Met Arg Val Leu Ala Met Val Val Ala Val 
250 255 260 



40 
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tac acc ate tgc ttc ttg ccc age ate ate ttc ggc atg get tee atg 1530 
Tyr Thr lie Cys Phe Leu Pro Ser lie lie Phe Gly Met Ala Ser Met 
255 270 275 280 

gtg get tte tgg ctg tec gee tgc cga tec ctg gac etc tgc aea cag 1578 
5 Val Ala Phe Trp Leu Ser Ala Cys Arg Ser Leu Asp Leu Cys Thr Gin 

285 290 295 

etc tte cat ggc tee ctg gcc ttc acc tac etc aac agt gtc ctg gae 1626 
Leu Phe His Gly Ser Leu Ala Phe Thr Tyr Leu Asn Ser Val Leu Asp 
300 305 310 

10 ccc gtg etc tac tgc ttc tct age ccc aac tte etc cac cag age egg 1674 

Pro Val Leu Tyr Cys Phe Ser Ser Pro Asn Phe Leu His Gin Ser Arg 

315 320 325 

gcc ttg ctg ggc etc acg egg ggc egg cag ggc cca gtg age gac gag 1722 
Ala Leu Leu Gly Leu Thr Arg Gly Arg Gin Gly Pro Val Ser Asp Glu 

15 330 335 340 

age tec tac caa ccc tec agg cag tgg cgc tac egg gag gcc tct agg 1770 

Ser Ser Tyr Gin Pro Ser Arg Gin Trp Arg Tyr Arg Glu Ala Ser Arg 

345 350 355 360 

aag gcg gag gcc ata ggg aag ctg aaa gtg cag ggc gag gtc tct ctg 1818 

20 Lys Ala Glu Ala lie Gly Lys Leu Lys Val Gin Gly Glu Val Ser Leu 

365 370 375 

gaa aag gaa ggc tee tec cag ggc tga gggccagctg cagggctgca 1865 
Glu Lys Glu Gly Ser Ser Gin Gly 

380 385 

25 gcgetgtggg ggtaagggct gccgcgctct ggcctggagg gacaaggcca gcacacggtgl92 5 

cctcaaccaa ctggaeaagg gatggcggca gaccaggggc caggccaaag cactggcaggl985 
actcatgtgg gtggcaggga gagaaaccca cetaggcctc tcagtgtgtc caggatggca2 045 
ttcccagaat gcaggggaga gcaggatgec gggtggagga gacaggcaag gtgccgttgg2 105 
eacaccagct eagacagggg cctgcgcagc tgcaggggac agacgccaat cactgteaca2 16 5 

30 gcagagteae ettagaaatt ggacagetgc atgttetgtg ctctceagtt tgtcccttcc2225 

aatattaata aacttccett ttaaatatat ttatttgcag accaatatet gtctttaatt2285 
ctaacctggg actgtcagta ggcgtcaaag tgagcgcccc agtgaaggaa ccttggagag2 34 5 
^gtgggagca ttcccagcet tecaggggga ctcgtcttcc agactttgga gcccgcatgt24 05 
ctgaagcaga ctctttcttg gtag 2429 

35 Another preferred embodiment comprises a purified and isolated 

polypeptide designated CON203, comprising the complete amino acid sequence set 
forth in SEQ ID NO: 8. This aniiao acid sequence was deduced from a 
polynucleotide sequence encoding CON203 (SEQ ID NO: 7), as set forth below: 
ttgaatttag gtgacactat agaagageta tgaegtcgca tgeacgcgta cgtaagcteg 60 
gaattcggct egagctgaac taatgaetgc cgecataaga agacagagag aactgagtat 120 



40 
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cctcccaaag gtgacactgg aagca atg aac acc aca gtg atg caa ggc ttc 172 

Met Asn Thr Thr Val Met Gin Gly Phe 
1 5 

aac aga tct gag egg tgc ccc aga gac act egg ata gta cag ctg gta 220 
Asn Arg Ser Glu Arg Cys Pro Arg Asp Thr Arg lie Val Gin Leu Val 
10 15 20 25 

ttc oca gcc etc tac aca gtg gtt ttc ttg acc ggc ate ctg ctg aat 268 
Phe Pro Ala Leu Tyr Thr Val Val Phe Leu Thr Gly lie Leu Leu Asn 

30 35 40 

act ttg get ctg tgg gtg ttt gtt cac ate ccc age tec tee acc ttc 316 
Thr Leu Ala Leu Trp Val Phe Val His lie Pro Ser Ser Ser Thr Phe 

45 50 55 

ate ate tac etc aaa aac act ttg gtg gee gac ttg ata atg aca etc 364 
lie lie Tyr Leu Lys Asn Thr Leu Val Ala Asp Leu lie Met Thr Leu 

60 65 70 

atg ctt cct ttc aaa ate etc tct gac tea cac ctg gca ccc tgg cag 412 
Met Leu Pro Phe Lys lie Leu Ser Asp Ser His Leu Ala Pro Trp Gin 

75 80 85 

etc aga get ttt gtg tgt cgt ttt tct teg gtg ata ttt tat gag ace 460 
Leu Arg Ala Phe Val Cys Arg Phe Ser Ser Val lie Phe Tyr Glu Thr 
90 95 100 105 

atg tat gtg ggc ate gtg ctg tta ggg etc ata gcc ttt gae aga ttc 508 
Met Tyr Val Gly lie Val Leu Leu Gly Leu lie Ala Phe Asp Arg Phe 

110 115 120 

etc aag ate ate aga cct ttg aga aat att ttt eta aaa aaa cct gtt 556 
Leu Lys lie lie Arg Pro Leu Arg Asn lie Phe Leu Lys Lys Pro Val 

125 130 135 

ttt gca aaa aeg gtc tea ate ttc ate tgg gtc ttt ttg gtc ttc ate 604 
Phe Ala Lys Thr Val Ser He Phe He Trp Val Phe Leu Val Phe He 

140 145 150 

tec ctg cea aat atg ate ttg age aac aag gaa gca aca cca teg tct 652 
Ser Leu Pro Asn Met He Leu Ser Asn Lys Glu Ala Thr Pro Ser Ser 

155 160 165 

gtg aaa aag tgt get tec tta aag ggg cct ctg ggg ctg aaa tgg cat 700 
Val Lys Lys Cys Ala Ser Leu Lys Gly Pro Leu Gly Leu Lys Trp His 
170 175 180 185 

caa atg gta aat aac ata tgc cag ttt att ttc tgg act ggt ttt ate 748 
Gin Met Val Asn Asn He Cys Gin Phe He Phe Trp Thr Gly Phe He 

190 195 200 

eta atg ctt gtg ttt tat gtg gtt att gca aaa aaa gta tat gat tct 796 
Leu Met Leu Val Phe Tyr Val Val He Ala Lys Lys Val Tyr Asp Ser 
205 210 215 *. 
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tat aga aag tec aaa agt aag gac aga aaa aac aac aaa aag ctg gaa 844 
Tyr Arg bys Ser Lys Ser Lys Asp Arg Lys Asn Asn Lys Lys Leu Glu 

220 225 230 

ggc aaa gta ttt gtt gtc gtg get gtc ttc ttt gtg tgt ctt get cca 892 
5 Gly Lys Val Phe Val Val Val Ala Val Phe Phe Val Cys Phe Ala Pro 

235 240 245 

ttt cat ttt gee aga gtt cca tat act cac agt caa acc aac aat aag 940 
Phe His Phe Ala Arg Val Pro Tyr Thr His Ser Gin Thr Asn Asn Lys 
250 255 260 265 

10 act gac tgt aga ctg caa aat caa ctg ttt att get aaa gaa aea act 988 

Thr Asp Cys Arg Leu Gin Asn Gin Leu Phe lie Ala Lys Glu Thr Thr 

270 275 280 

etc ttt ttg gca gca act aac att tgt atg gat ccc tta ata tac ata 1036 
Leu Phe Leu Ala Ala Thr Asn lie Cys Met Asp Pro Leu lie Tyr lie 
15 285 290 295 

ttc tta tgt aaa aaa ttc aca gaa aag eta cca tgt atg caa ggg aga 1084 
Phe Leu Cys Lys Lys Phe Thr Glu Lys Leu Pro Cys Mec Gin Gly Arg 

300 305 310 

aag acc aca gca tea age caa gaa aat cat age agt cag aea gac aac 1132 
20 Lys Thr Thr Ala Ser Ser Gin Glu Asn His Ser Ser Gin Thr Asp Asn 

315 320 325 

ata acc tta ggc tga caactgtaca tagggttaac ttctatttat tgatgagact 1187 
lie Thr Leu Gly 
330 

25 tccgtagata atgtggaaat caaatttaac caagaaaaaa agattggaac aaatgctctcl247 

ttacatttta tttatcctgg tgtccaggaa aagattatat taaatttaaa tceacatagal307 
tctattcata agctgaatga accattacet aagagaatgc aacaggatac caatggecacl367 
tagaggcata ttcettctte tttttttttt gttaaatttc aagageattc actttaeatt 1427 
tggaaagact aaggggaacg gttatcctae aaacctccct tcaaeacctt ttacatt 1484 

30 Another preferred embodiment comprises a puri fied and isolated 

polypeptide designated CON198, comprising the complete amino acid sequence set 
forth in SEQ ID NO: 10, This amino acid sequence was deduced from a 
polynucleotide sequence encoding CON! 98 (SEQ ID NO: 9), as set forth below: 

atg atg gtg gat ccc aat ggc aat gaa tec agt get aca tac ttc ate 48 
35 Met Met Val Asp Pro Asn Gly Asn Glu Ser Ser Ala Thr Tyr Phe lie 

15 10 15 



40 



45 



eta ata ggc etc act ggt tta gaa gag get cag ttc tgg ttg gee ttc 96 

Leu He Gly Leu Pro Gly Leu Glu Glu Ala Gin Phe Trp Leu Ala Phe 
20 25 30 

cca ttg tgc tec etc tac ctt att get gtg eta ggt aac ttg aca ate 144 

Pro Leu Cys Ser Leu Tyr Leu He Ala Val Leu Gly Asn Leu Thr lie 
35 40 . 45 
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atc tac att gtg egg act gag cac age etg cat gag cec atg tat ata 192 

lie Tyr lie Val Arg Thr Glu His Ser Leu His Glu Pro Met Tyr He 

50 55 60 

ttt ett tgc atg ctt tea ggc att gac ate etc ate tec acc tea tec 240 
Phe Leu Cys Met Leu Ser Gly He Asp He Leu He Ser Thr Ser Ser 
65 70 75 80 

atg cee aaa atg ctg gee ate ttc tgg ttc aat tec act ace ate cag 288 
Met Pro Lys Met Leu Ala He Phe Trp Phe Asn Ser Thr Thr He Gin 
85 90 95 

ttt gat get tgt etg eta cag atg ttt gee ate cac tec tta tct ggc 336 
Phe Asp Ala Cys Leu Leu Gin Met Phe Ala He His Ser Leu Ser Gly 
100- 105 110 

atg gaa tec aca gtg ctg ctg gcc atg get ttt gac cgc tat gtg gcc 384 
Met Glu Ser Thr Val Leu Leu Ala Met Ala Phe Asp Arg Tyr Val Ala 
115 120 125 

ate tgt cac cca etg cgc cat gee aca gta ctt acg ttg cct cgt gtc 432 
He Cys His Pro Leu Arg His Ala Thr Val Leu Thr Leu Pro Arg Val 
130 135 140 

acc aaa att ggt gtg get get gtg gtg egg ggg get gea ctg atg gca 480 
Thr Lys He Gly Val Ala Ala Val Val Arg Gly Ala Ala Leu Met Ala 
145 150 155 160 

cec ett cct gte tte ate aag cag ctg cec ttc tgc cgc tec aat ate 528 
Pro Leu Pro Val Phe He Lys Gin Leu Pro Phe Cys Arg Ser Asn He 
165 170 175 

ctt tec cat tec tac tgc eta cac caa gat gtc atg aag ctg gcc tgt 576 
Leu Ser His Ser Tyr Cys Leu His Gin Asp Val Met Lys Leu Ala Cys 
180 185 190 

gat gat ate egg gtc aat gtc gtc tat ggc ctt ate gte ate ate tec 624 
Asp Asp He Arg Val Asn Val Val Tyr Gly Leu He Val He He Ser 
195 200 205 

gee att ggc ctg gac tea ctt etc ate tec tte tea tat ctg ctt att 672 
Ala He Gly Leu Asp Ser Leu Leu He Ser Phe Ser Tyr Leu Leu He 
210 215 220 

ctt aag act gtg ttg ggc ttg aca cgt gaa gcc eag gee aag gca ttt 720 
Leu Lys Thr Val Leu Gly Leu Thr Arg Glu Ala Gin Ala Lys Ala Phe 
225 230 235 240 

ggc act tgc gtc tct cat gtg tgt get gtg ttc ata ttc tat gta cct 768 
Gly Thr Cys Val Ser His Val Cys Ala Val Phe He Phe Tyr Val Pro 
245 250 255 

tte att gga ttg tec atg gtg cat cgc ttt age aag egg cgt gac tct 816 
Phe He Gly Leu Ser Met Val His Arg Phe Ser Lys Arg Arg Asp Ser 
260 265 270 

ccg ctg cee gte ate ttg gcc aat ate tat etg etg gtt cct cct gtg 864 
Pro Leu Pro Val He Leu Ala Asn He Tyr Leu Leu Val Pro Pro Val 
275 280 285 

etc aac cca att gte tat gga gtg aag aca aag gag att cga cag cgc 912 
Leu Asn Pro He Val Tyr Gly Val Lys Thr Lys Glu He Arg Gin Arg 
290 295 300 
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ate ctt cga etc ttc cat gtg gee aca cac get tea gag ccc tag 957 
He Leu Arg Leu Phe His Val Ala Thr His Ala Ser Glu Pro 
305 310 315 

It will be appreciaied thai SEQ ID NO: 10 contains methionine 

5 residues at positions I and 2. Translation of the relevant mRNA sequences may occur 

beginning from either or both methionines, which can be determined for a particular • 

cell source by purifying expressed CON! 98 protein and performing amino-terminal 

sequencing thereon. CON 1 98 polypeptides beginning at either Met, or Met2 of SEQ 

JD NO: 10 are intended a polypeptides of the invention. 

1 0 Another preferred embodiment comprises a purified and isolated 

polypeptide designated GON 1 97, comprising the complete amino acid sequence set 

forth in SEQ ID NO: 12. This amino acid sequence was deduced fi-om a 

polynucleotide sequence encoding CON 197 (SEQ ID NO: 11), as set forth below: 
1 

15 ATGGAAAGCGAGAACAGAAGAGTGATAAGAGAATTCATCCTCCTTGGTCTGACCCAGTCTCAAGATATT 
MESENRRVIREFILLGLTQSQDI 

70 

CAGCTCCTGGTCTTTGTGCTAGTTTTAATATTCTACTTCATCATCCTCCCTGGAAATTTTCTCATTATT 
20 QLLVFVLVLIFYFIILPGNFLII 

139 

TTCACCATAAAGTCAGACCCTGGGCTCACAGCCCCCCTCTATTTCTTTCTGGGCAACTTGGCCTTCCTG 
FTIKSDPGLTAPLYFFLGNLAFL 

25 

208 

GATGCATCCTACTCCTTCATTGTGGCTCCCCGGATGTTGGTGGACTTCCTCTCTGCGAAGAAGATAATC 
DASYSFIVAPRMLVDFLSAKKI I 

30 277 

TCCTACAGAGGCTGCATCACTCAGCTCTTTTTCTTGCACTTCCTTGGAGGAGGGGAGGGATTACTCCTT 
SYRGCITQLFFLHFLGGGEGLLL 

346 

35 GTTGTGATGGCCTTTGACCGCTACATCGCCATCTGCCGGCCTCTGCACTATCCTACTGTCATGAACCCT 
VVMAFDRYIAICRPLHYPTVMNP 

415 

AGAACCTGCTATGCAATGATGTTGGCTCTGTGGCTTGGGGGTTTTGTCCACTCCATTATCCAGGTGGTC 
40 RTCYAMMLALWLGGFVHSIIQVV 

484 

CTCATCCTCCGCTTGCCTTTTTGTGGCCCAAACCAGCTGGACAACTTCTTCTGTGATGTCCCACAGGTC 
LILRLPFCGPNQLDNFFCDVPQV 

553 

ATCAAGCTGGCCTGCACCGACACATTTGTGGTGGAGCTTCTGATGGTCTTCAACAGTGGCCTGATGACA 
IKLACTDTFVVELLMVFNSGLMT 

50 622 

CTCCTGTGCTTTCTGGGGCTTCTGGCCTCCTATGCAGTCATTCTTTGTCGCATACGAGGGTCTTCTTCT 
LLCFLGLLASYAVILCRIRGSSS 
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691 

GAGGCAAAAAACAAGGCCATGTCCACGTGCATCACCCATATCATTGTTATATTCTTCATGTTTGGACCT 
EAKNKAMSTCITHI IVIFFMFGP 

5 760 

GGCATCTTCATCTACACGCGCCCCTTCAGGGCTTTCCCAGCTGACAAGGTGGTTTCTCTCTTCCACACA 
GIFIYTRPFRAFPADKVVSLFHT 

829 

10 GTGATTTTTCCTTTGTTGAATCCTGTCATTTATACCCTTCGCAACCAGGAAGTGAAAGCTTCCATGAAA 
VIFPLLNPVIVTLRNQEVKASMK 

898 

AAGGTGTTTAAT AAG CAC ATAGCCTG AAAAAGGG CG C AA AAAAAAAAAG AAT AAAAATAG ACTGTAG AA 
15 KVFNKHIA* 

967 

TTTTTAAAAAAAAAAAAAAAAAAAAAAAA 

Another preferred embodiment comprises a purified and isolated 
20 polypeptide designated CON202, comprising the complete amino acid sequence set 
forth in SEQ ID NO: 14. This amino acid sequence was deduced from a 
polynucleotide sequence encoding CON202 (SEQ ID NO: 13), as set forth below: 



25 



35 



55 



1 

TGCTTCCCCATAAGGTAACAGCTTTGTTAGCNCTGTCTGACATCATTGCTTGTTNACTTAAGAACTGAT 
70 



139 

30 CTGCAGATCAGATCAGTTCTCTTTGTGGATTATATTTTCAGTAAAATGTATGGATCTATCTTTTCCTTG 



208 

TTCTTATATCTAGATCATGAGACTTGACTGAGGCTGTATCCTTATCCTCCATCCATCTATGGCGAACTA 

MANY 

277 

TAGCCATGCAGCTGACAACATTTTGCAAAATCTCTCGCCTCTAACAGCCTTTCTGAAACTGACTTCCTT 
SHAADNILQNLSPLTAFLKLTSL 

40 346 

GGGTTTCATAATAGGAGTCAGCGTGGTGGGCAACCTCCTGATCTCCATTTTGCTAGTGAAAGATAAGAC 
GFI IGVSVVGNLLIS ILLVKDKT 

415 

45 CTTGCATAGAGCACCTTACTACTTCCTGTTGGATCTTTGCTGTTCAGATATCCTCAGATCTGCAATTTG 
LHRAPYYFLLDLCCSDILRSAIC 

484 

TTTCCCATTTGTGTTCAACTCTGTCAAAAATGGTTCTACCTGGACTTATGGGACTCTGACTTGCAAAGT 
50 FPFVFNSVKNGSTWTYGTLTCKV 

553 

GATTGCCTTTCTGGGGGTTTTGTCCTGTTTCCACACTGCTTTCATGCTCTTCTGCATCAGTGTCACCAG 
lAFL GVLSCFHTAFMLFCISVTR 



622 

ATATTTAGCTATCGCCCATCACCGCTTCTATACAAAGAGGCTGACCTTTTGGACGTGTCTGGCTGTGAT 
YLAIAHHRFYTKRLTFWTCLAVI 
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691 

CTGTATGGTGTGGACTCTGTCTGTGGCCATGGCATTTCCCCCGGTTTTAGACGTGGGCACTTACTCATT 
CMVWTLSVAMAFPPV LDVGTYS F 

5 760 

CATTAGGGAGGAAGATCAATGCACCTTCCAACACCGCTCCTTCAGGGCTAATGATTCCTTAGAATTTAT 
IREEDQCTFQHRSFRANDSLGFM 

829 

10 GCTGCTTCTTGCTCTCATCCTCCTAGCCACACAGCTTGTCTACCTCAAGCTGATATTTTTCGTCCACGA 
LLLALILLATQLVYLKLIFFVHD 

898 

TCGAAGAAAAATGAAGCCAGTCCAGTTTGTAGCAGCAGTCAGCCAGAACTGGACTTTTCATGGTCCTGG 
15 RRKMKPVQFVAAVSQNWTFHGPG 

967 

AGCCAGTGGCCAGGCAGCTGCCAATTGGCTAGCAGGATTTGGAAGGGGTCCCACACCACCCACCTTGCT 
ASGQAAANWLAGFGRGPTPPTLL 

20 

1036 

GGGCATCAGGCAAAATGCAAACACCACAGGCAGAAGAAGGCTATTGGTCTTAGACGAGTTCAAAATGGA 
GIRQNANTTGRRRLLVLDEFKME 

25 1105 

GAAAAGAATCAGCAGAATGTTCTATATAATGACTTTTCTGTTTCTAACCTTGTGGGGCCCCTACCTGGT 
KRISRMFYIMTFLFLTLWGPYLV 

30 1174 

GGCCTGTTATTGGAGAGTTTTTGCAAGAGGGCCTGTAGTACCAGGGGGATTTCTAACAGCTGCTGTCTG 
ACYWRVFARGPVVPGGFLTAAVW 

35 1243 

GATGAGTTTTGCCCAAGCAGGAATCAATCCTTTTGTCTGCATTTTCTCAAACAGGGAGCTGAGGCGCTG 
MS FAQA GIN PFVCIFSNRELRRC 

1312 

40 TTTCAGCACAACCCTTCTTTACTGCAGAAAATCCAGGTTACCAAGGGAACCTTACTGTGTTATATGAGG 
FSTTLLYCRKSRLPREPYCVI 

Still another preferred embodiment comprises a purified and isolated 
polypeptide designated CON222, comprising the complete amino acid sequence set 
forth in SEQ ID NO: 16. This amino acid sequence was deduced from a 
45 polynucleotide sequence encoding CON222 (SEQ ID NO: 1 5), as set forth below: 

1 ATGTTTAGACCTCTTGTGAATCTCTCTCACATATATTTTAAGAAATTCCAGTACTGTGGGTATGCA 
MFRPLVNLSHIYFKKFQYCGYA 
6 7 CCACATGTTCGCAGCTGTAAACCAAACACTGATGGAATTTCATCTCTAGAGAATCTCTTGGCAAGC 
PHVRSCKPNTDGrSSLENLLAS 
50 133 ATTATTCAGAGAGTATTTGTCTGGGTTGTATCTGCAGTTACCTGCTTTGGAAACATTTTTGTCATT 

I IQRVFVWVVSAVTCFGNI FV I 
199 TGCATGCGACCTTATATCAGGTCTGAGAACAAGCTGTATGCCATGTCAATCATTTCTCTCTGCTGT 
CMRPYIRSENKLYAMSI ISLCC 
265 GCCGACTGCTTAATGGGAATATATTTATTCGTGATCGGAGGCTTTGACCTAAAGTTTCGTGGAGAA 
55 ADCLMGIYLFVIGGFDLKFRGE 
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3 31 TACAATAAGCATGCGCAGCTGTGGATGGAGAGTACTCATTGTCAGCTTGTAGGATCTTTGGCCATT 

YNKHAOLWMESTHCQLVGSLAI 
3 97 CTGTCCACACAAGTATCAGTTTTACTGTTAACATTTCTGACATTGGAAAAATACATCTGCArrGTC 

LSTEVSVLLLTFLTLEKYTCIV 
5 4 6 3 TATCCTTTTAGATGTGTGAGACCTGGAAAATGCAGAACAATTACAGTTCTGATTCTCATTTGGATT 

YPFRCVRPGKCRTITVLI L I WI 
52 9 ACTGGTTTTATAGTGGCTTTCATTCCATTGAGCAATAAGGAATTTTTCAAAAACTACTATGGCACC 

TGFIV AFIPLSNKEFFKNYYGT 

5 95 AATGG AGTATGCTTCCCTCTTC ATTCAGAAGATACAGAAAGTATTGGAGCCCAGATTTATTCAGl'G 
10 N .GVCFPLHSEDTESIGAQIYSV 

6 6 X GCAATTTTTCTTGGTATTAATTTGGCCGCATTTATCATCATAGTTTTTTCCTATGGAAGCATGTTT 

AIFLGINLAAFIIIVFSYGSMF 
72 7 TATAGTGTTCATCAAAGTGCCATAACAGCAACTGAAATACGGAATCAAGTTAAAAAAGAGATGATC 
YSVHQSAITATEIRNQVKKEMI 
15 793 CTTGCCAAACGTTTTTTCTTTATAGTATTTACTGATGCATTATGCTGGATACCCATTTTTGTAGTG 

LAKRFFFIVFTDALCWIPIFVV 
85 9 AAATTTCTTTCACTGCTTCAGGTAGAAATACCAGGTACCATAACCTCTTGGGTAGTGATTTTTATT 

KFLS L LQVEI PGTIT SWVVI F X 
925 CTGCCC ATTAACAGTGCTTTGAACCCAATTCTCTATACTCTGACCACAAGACCATTTAAAGAAATG 
20 LPINSALNPI LYTLTTRPFKEM 

9 9 X ATTCATCGGTTTTGGTATAACTACAGACAAAGAAAATCTATGGACAGCAAAGGTCAGAAAACATAT 
IHRFWYNYRQRKSMDSKGQKTY 
1057 GCTCCATCATTCATCTGGGTGGAAATGTGGCCACTGCAGGAGATGCCACCTGAGTTAATGAAGCCG 

AP.SFIWV EMWPLQEMPPELMKP 1X23 
25 GACCTTTTCACATACCCCTGTGAAATGTCACTGATTTCTCAATCAACGAGACTCAATTCCTATTCA 

DLFTYPCEMSLISQSTRLNSYS 
1189 TGA X19X 
* 

Another preferred embodiment comprises a purified and isolated 
30 polypeptide designated CON215, comprising the complete amino acid sequence set 
forth in SEQ ID NO; 1 8. This amino acid sequence was deduced from a 
polynucleotide sequence encoding CON215 (SEQ ID NO: 17), as set forth below: 

atg ggg ttc aac ttg acg ctt gca aaa tta oca aat aac gag ctg cac 4 8 
Met Gly Phe Asn Leu Thr Leu Ala Lys Leu Pro Asn Asn Glu Leu His 
35 1 5 10 15 

ggc caa gag agt cac aat tea ggc aac agg age gac ggg cca gga aag 96 

Gly Gin Glu Ser His Asn Ser Gly Asn Arg Ser Asp Gly Pro Gly Lys 

20 25 30 

40 

aac ace acc ctt cac aat gaa ttt gac aca att gtc ttg cca gtg ctt 144 

Asn Thr Thr Leu His Asn Glu Phe Asp Thr He Val Leu Pro Val Leu 

35 40 45 

tat etc att ata ttt gtg gca age ate ttg ctg aat ggt tta gca gtg 192 

45 Tyr Leu He Tie Phe Val Ala Ser He Leu Leu Asn Gly Leu Ala Val 

50 55 60 
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tgg ate ttc ttc cac att agg aat aaa acc age ttc ata ttc tat etc 240 

Trp lie Phe Phe His lie Arg Asn Lys Thr Ser Phe lie Phe Tyr Leu 
65 70 75 80 

5 aaa aae ata gtg gtt gca gae etc ata atg aeg ctg aca ttt eca ttt 288 

Lys Asn He Val Val Ala Asp Leu He Met Thr Leu Thr Phe Pro Phe 
85 90 95 

cga ata gtc cat gat gca gga ttt gga cct tgg tac ttc aag ttt att 336 

10 Arg He Val His Asp Ala Gly Phe Gly Pro Trp Tyr Phe Lys Phe lie 

100 105 110 

etc tgc aga tac act tea gtt ttg ttt tat gca aac atg tat act tec 384 

Leu Cys Arg Tyr Thr Ser Val Leu Phe Tyr Ala Asn Met Tyr Thr Ser 

15 115 120 125 

ate gtg ttc ctt ggg ctg ata age att gat cgc tat ctg aag gtg gtc 432 

He Val Phe Leu Gly Leu He Ser He Asp Arg Tyr Leu Lys Val Val 

130 135 140 



20 



40 



aag cca ttt ggg gac tct egg atg tae age ata acc ttc acg aag gtt 480 
Lys Pro Phe Gly Asp Ser Arg Met Tyr Ser He Thr Phe Thr Lys Val 
145 ISO 155 160 



25 tta tet gtt tgt gtt tgg gtg ate atg get gtt ttg tct ttg cca aac 528 

Leu Ser Val Cys Val Trp Val He Met Ala Val Leu Ser Leu Pro Asn 

165 170 175 

ate ate ctg aca aat ggt eag cca aea gag gac aat ate cat gac tgc 576 

30 He He Leu Thr -Asn Gly Gin Pro Thr Glu Asp Asn He His Asp Cys 

180 185 190 

tea aaa ctt aaa agt cct ttg ggg gtc aaa tgg cat acg gca gtc ace 624 

Ser Lys Leu Lys Ser Pro Leu Gly Val Lys Trp His Thr Ala Val Thr 

35 195 200 205 

tat gtg aae age tgc ttg ttt gtg gee gtg ctg gtg att ctg ate gga 672 

Tyr Val Asn Ser Cys Leu Phe Val Ala Val Leu Val He Leu He Gly 

210 215 220 



tgt tac ata gee ata tee agg tae ate cac aaa tec age agg caa ttc 720 
Cys Tyr He Ala He Ser Arg Tyr He His Lys Ser Ser Arg Gin Phe 
225 230 235 240 



45 ata agt eag tea age ega aag ega aaa cat aae eag age ate agg gtt 768 

He Ser Gin Ser Ser Arg Lys Arg Lys His Asn Gin Ser He Arg Val 

245 250 255 

gtt gtg get gtg ttt ttt ace tgc ttt eta eca tat cac ttg tgc aga 816 

50 Val Val Ala Val Phe Phe Thr Cys Phe Leu Pro Tyr His Leu Cys Arg 

260 265 270 

att cct ttt act ttt agt cac tta gae agg ctt tta gat gaa tct gca 864 

He Pro Phe Thr Phe Ser His Leu Asp Arg Leu Leu Asp Glu Ser Ala 

55 275 280 285 



caa aaa ate eta tat tae tgc aaa gaa att aca ctt ttc ttg tct gcg 912 
Gin Lys He Leu Tyr Tyr Cys Lys Glu He Thr Leu Phe Leu Ser Ala 
60 290 295 300 
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tgt aat gtt tgc ctg gat cca ata att tac ttt ttc atg tgt agg tea 960 
Cys Asn Val Cys Leu Asp Pro lie lie Tyr Phe Phe Met Cys Arg Ser 
30S 310 315 320 

5 ttt tea aga agg ctg ttc aaa aaa tea aat ate aga acc agg agt gaa 1008 

Phe Ser Arg Arg Leu Phe Lys Lys Ser Asn He Arg Thr Arg Ser Glu 
325 330 335 

age ate aga tea ctg caa agt gtg aga aga teg gaa gtt etc ata tat 1056 
10 Ser He Arg Ser Leu Gin Ser Val Arg Arg Ser Glu Val Leu He Tyr 

340 345 350 

tat gat tat act gat gtg tag 1077 
Tyr Asp Tyr Thr Asp Val 
15 355 

Another preferred embodiment comprises a purified and isolated 
polypeptide designated CON21 7, comprising the complete amino acid sequence set 
forth in SEQ ID NO: 20. This amino acid sequence was deduced from a 
polynucleotide sequence encoding CON217 (SEQ ID NO: 19)» as set forth below: 

20 -41 C ATGGCATCCC CAGCCTAGCT CCCAATCCCA CTTTGGCACG 

1 ATGTTAGCCAACAGCTCCTCAACCAACAGTTCTGTTCTCCCGTGTCCTGACTACCGACCTACCCAC 
MLANSSSTNSSVLPCPDYR PTH 
67 CGCCTGCACTTGGTGGTCTACAGCTTGGTGCTGGCTGCCGGGCTCCCCCTCAACGCGCTAGCCCTC 
RLHLVVYSLVLAAGLPLNALAL 
25 133 TGGGTCTTCCTGCGCGCGCTGCGCGTGCACTCGGTGGTGAGCGTGTACATGTGTAACCTGGCGGCC 

WVFLRALRVHSVVSVYMCNLAA 
199 AGCGACCTGCTCTTCACCCTCTCGCTGCCCGTTCGTCTCTCCTACTACGCACTGCACCACTGGCCC 

SDLLFTLSLPVRLSYYALHHWP 
265 TTCCCCGACCTCCTGTGCCAGACGACGGGCGCCATCTTCCAGATGAACATGTACGGCAGCTGC ATC 
30 FPDLLCQTTGAIFQMNMYGSCI 
331 TTCCTGATGCTCATCAACGTGGACCGCTACGCCGCCATCGTGCACCCGCTGCGACTGCGCCACCTG 

FLML INVDRYAAIVHPLRLRHL 
397 CGGCGGCCCCGCGTGGCGCGGCTGCTCTGCCTGGGCGTGTGGGCGCTCATCCTGGTGTTTGCCGTG 
RRP RVARLLCLGVWALI LVFAV 
35 4 63 CCCGCCGCCCGCGTGCACAGGCCCTCGCGTTGCCGCTACCGGGACCTCGAGGTGCGCCTATGCTTC 

PAARVHRPSRCRYRDLEVRLCF 
52 9 GAGAGCTTCAGCGACGAGCTGTGG AAAGGCAGGCTGCTGCCCCTCGTGCTGCTGGCCGAGGCGCTG 

ESFSDELWKGRLLPLVLLAEAL 
595 GGCTTCCTGCTGCCCCTGGCGGCGGTGGTCTACTCGTCGGGCCGAGTCTTCTGGACGCTGGCGCGC 
40 GFLLPLAAVVYSSGRVFWTLAR 
661 CCCGACGCCACGCAGAGCCAGCGGCGGCGGAAGACCGTGCGCCTCCTGCTGGCTAACCTCGTCATC 

PDATOSQRRRKTVRLLLANLVI 
727 TTCCTGCTGTGCTTCGTGCCCTACAACAGCACGCTGGCGGTCTACGGGCTGCTGCGGAGCAAGCTG 
FLLCFVPYNSTLAVYGLLRSKL 
45 793 GTGGCGGCCAGCGTGCCTGCCCGCGATCGCGTGCGCGGGGTGCTGATGGTGATGGTGCTGCTGGCC 

VAASVPARDRVRGVLMVMVLLA 
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859 GGCGCCAACTGCGTGCTGGACCCGCTGGTGTACTACTTTAGCCCCGAGGGCTTCCGCAACACCCTG 
GANCVLDPLVYYFSAEGFRNTL 

925 CGCGGCCTGGGCACTCCGCACCGGGCCAGGACCTCGGCCACC AACGGG ACGCGGGCGGCGCTCGCG 
RGLGTPHRARTSATNGTRAALA 

991 CAATCCGAAAGGTCCGCCGTCACCACCGACGCCACCAGGCCGGATGCCGCCAGTCAGGGGCTGCTC 
QSERSAVTTDATRPDAASQGLij 
10 5 7 CGACCCTCCGACTCCCACTCTCTGTCTTCCTTCACACAGTGTCCCCAGGATTCCGCCCTCTGAACA 





R P S D S H S 


L S S F 


T Q C P 0 D S 


A L * 


1123 


CACATGCCAT 


TGCGCTGTCC 


GTGCCCGACT 


CCCAACGCCT 


CTCGTTCTGG 


GAGGCTTACA 


1183 


GGGTGTACAC 


ACAAGAAGGT 


GGGCTGGGCA 


CTTGGACCTT 


TGGGTGGCAA 


TTCCAGCTTA 


1243 


GCAACGCAGA 


AGAGTACAAA 


GTGTGGAAGC 


CAGGGCCCAG 


GGAAGGCAGT 


GCTGCTGGAA 


1303 


ATGGCTTCTT 


TAAACTGTGA 


GCACGCAGAG 


CACCCCTTCT 


CCAGCGGTGG 


GAAGTGATGC 


1363 


AGAGAGCCCA 


CCCGTGCAGA 


GGGCAGAAGA 


GGACCAAATG 


CCTTTGGGTG 


GGCAGGGCAT 


1423 


TAAACTGCTA 


AAAGCTGGTT 


AGATGGAACA 


GAAAATGGGC 


ATTCTGGATC 


TAAACCGCCA 


1483 


CAGGGGCCTG 


AGAGCTGAAG 


AGCACCAGGT 


TTGGTGGACA 


AAGCTACTGA 


GATGCCTGTT 


1543 


CATQJGCTGA 


CTTCTGTCTA 


GGCTCATGGA 


TGCCACCCCC 


TTTCATTTCG 


GCCTAGGCTT 


1603 


CCCCTGCTCA 


CCACTGAGGC 


CTAATACAAG 


AGTTCCTATG 


GACAGAACTA 


CATTCTTTCT 


1663 


CGCATAGTGA 


CTTGTGACAA 


TTTAGACTTG 


GCATCCAGCA 


TGGGATAGTT 


GGGGCAAGGC 


1723 


AAAACTAACT 


TAGAGTTTCC 


CCCTCAACAA 


CATCCAAGTC 


CAAACCCTTT 


TTAGGTTATC 


1783 


CTTTCTTCCA 


TCACATCCCC 


TTTTCCAGGC 


CTCCTCCATT 


TTAGGTCCTT 


AATATTCTTT 


1843 


CTTTTTCTCT 


CTCTCTCGTT 


TCTCTCTTCT 


CTCTCCTCTC 


CTCTCCTCTC 


TCTTCTCCTC 


1903 


TTCTCTCTCT 


CTCCCTCTCT 


CTCCTTTGTC 


CAGAGTAAGG 


ATAAAATTCT 


TTCTACTAAA 


1963 


GCACTGGTTC 


TCAAACTTTT 


TGGTCTCAGA 


CCCCACTCTT 


AGAAATTGAG 


GATCTCAAAG 


2023 


AGCTTTGCTT 


ATATTTTGTT 


CTTTTGATAC 


TTACCATACT 


AGAAATTAAA 


GCGAATACAT 


2083 


TTTTAAAATA 


AATACACATG 


CACACATTAC 


ATTAGCCATG 


GGAGCAATAA 


TGTCACCACA 


2143 


CACACTTCAT 


GAAGCCTCTG 


GA7V7VACTCTA 


CAGTATACTT 


GTGAGAGAAT 


GAGAGTGAAA 


2203 


GGGACAAATA 


ACATCTGTGT 


AGCAGTATTA 


TGAAAATAGC 


TTGACCTTGT 


GGACTTCCTC 


2263 


AGAGGGTTGG 


TCCCTGGATC 


ACACTTTGAG 


AACCATACTT 


GTCCTGAAGT 


ATTGGAGTTC 


2323 


ATGTCTAACT 


TCTTCCCAGG 


GCATTATGTA 


CAGTGCTTTT 


TATTACTGTG 


GGGAGAGGGC 


2383 


AGTGCTAAAT 


AAATTAATCA 


CTACTGATAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAA 




Although SEQ ID 


NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, and 20 provide 



for particular human sequences, the invention is intended to include within its scope 
other human allelic variants; non-human mammalian forms of GPCR polypeptides, 
and other vertebrate forms of GPCR polypeptides. 



It will be appreciated that extracellular epitopes are particularly useful 
for generating and screening for antibodies and other binding compounds that bind to 
receptors such as GPCR polypeptides. Thus, in another preferred embodiment, the 
invention provides a purified and isolated polypeptide comprising at least one 
extracellular domain of a GPCR polypeptide of the invention. By "extracellular 
domain", is it meant the amino terminal extracellular domain or an extracellular loop 
that spans two membrane domains. 
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A purified and isolated polypeptide comprising the N-lemiinal 
extracellular domain ofCPCR polypeptides of the invention is highly preferred. Also 
prefen-ed is a purified and isolated polypeptide comprising a GPCR seven 
transmembrane receptor fragment selected from the group consisting of the 
5 N-tenninal extracellular domain of GPCR polypeptides of the invention, 

transmembrane domains of GPCR polypeptides of the invention^ extracellular loops 
connecting transmembrane domains of GPCR polypeptides of the invention, 
intracellular loops connecting transmembrane domains of GPCR polypeptides of the 
invention, the C-terminal cytoplasmic domain of GPCR polypeptides, and fusions 

10 thereof Such fragments may be continuous portions of the native receptor. However, 

it will also be appreciated that knowledge of the GPCR gene and protein sequences as 
provided herein permits recombining of various domains that are not contiguous in 
the native protein. 

Tn another embodiment, the invention provides purified and isolated 

1 5 polynucleotides {e.g. , cDNA, genomic DNA, synthetic DNA, RNA, or combinations 
thereof, single or double stranded) that comprise a nucleotide sequence encoding an 
amino acid sequence of the polypeptides of the invention. Another embodiment 
provides a purified and isolated polynucleotide encoding the amino acid sequence of 
the polypeptide of the invention fused to a heterologous tag amino acid sequence. 

20 Such polynucleotides are useful for recombinantly expressing the receptor and also for 
detecting expression of the receptor in cells (e.g., using Northern hybridization and in 
situ hybridization assays, and Western studies). Polynucleotides encoding 
polypeptides of the invention also are useful to design antisense and other molecules 
for the suppression of GPCR polypeptides expression in a cultured cell or animal (for 

25 therapeutic purposes or to provide a model for diseases characterized by aberrant 

GPCR polypeptide expression). Such polynucleotides are also useful to design 
antisense and other molecules for the suppression of GPCR polypeptide expression in 
a cultured cell or tissue or in an animal, for therapeutic purposes or to provide a model 
for diseases characterized by aberrant GPCR polypeptide expression. Specifically 

30 excluded from the definition of polynucleotides of the invention are entire isolated 

chromosomes of native host cells. A preferred polynucleotide set forth in any one of 
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theSEQlDNOS: 1,3,5,7,9, 11, 13, 15, 1 7, ancn9 coiresponds to a nalurally 
occuning GPCR sequence. It will be appreciated that numerous other sequences 
exist that also encode GPCR polypeptides having the amino acid sequence set out in 
SEQ ID NOS: 2, 4, 6, 8, 10, 12 ,14, 16, 18 and 20 due to the well-known degeneracy 
5 of the universal genetic code. All such sequences represent polynucleotides of the 

invention. 

The invention also provides a purified and isolated polynucleotide 
comprising a nucleotide sequence that encodes a mammalian seven transmembrane 
receptor, wherein the polynucleotide hybridizes to a nucleotide sequence set forth in 
10 any one of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, or 19 or the non-coding strand 
complementary thereto, under the following hybridization conditions: 

(a) hybridization for 16 hours at 42''C in a hybridization solution comprismg 
50% formamide, 1% SDS, 1 M NaCl, 10% Dextran sulphate; and 

(b) washing 2 times for 30 minutes at 60°C in a wash solution comprising 
15 0.1% SSC, 1% SDS. Polynucleotides that encode a human allelic variant are highly 

preferred. 

A highly preferred polynucleotide of the invention comprises the 
sequence set forth in SEQ ID NO: 1, which comprises a human CON193 encoding 
DNA sequence: 



20 



25 



30 



35 



ntggttgttg gaccattaaa atgcattatg gaatttttaa aagttggggg agagggagac 60 
agtaaaaata acctaUattt tctcttgttt tttttttttt aactctagga aagcccagac 120 
aaattttgag ctatttcata acctaccaga cttatcatgc taacactgaa taaaacagac 180 
ctaataccag cttcatttat tctgaatgga gccccaggac tggaagacac acaactctgg 240 
acttccttcc cattctgccc tatgtatgtt gtggctatgg tagggaattg tggactcctc 300 
tacctcattc actatgagga tgccctgcac aaacccatgt actacttctt ggccatgctt 360 
tcctttactg accttgttat gtgctctagt acaatcccta aagccctctg catcttctgg 420 
tttcabctca aggacabtgg atttgatgaa tgccttgtcc agatgttctt catccacacc 480 
ttcacaggga tggagtctgg ggtgcttatg cttatggccc tggatcgcta tgtggccatc 540 
tgctacccct tacgctattc aactatcctc accaatcctg taattgcaaa ggttgggact 600 
gccaccctcc tgagaggggt attactcatt attcccttta ctttcctcac caagcgcctg 660 
ccctcctgca gaggcaatat acctccccat acctactgtg accacatgtc tgtagccaaa 720 
ttgtcctgtg gtaatgtcaa ggtcaatgcc atctatggtc tgatggtLgc cctcctgatt 780 
gggggctttg acatactgtg tatcaccatc tcctatacca tgattctccg ggcagtggtc 840 
agcctctcct cagcagacgc tcggcagaag gcctttaata cctgcactgc ccacatttgt 900 
gccatcgttc tctcctatac tccagcttcc ttctccctct tttcccaccg ctttggggaa 960 
cacataaccc ccccttcttg ccacatcatt gtagccaata tttatctgct cctaccaccc 1020 
actatgaacc ctattgtcta tggggtgaaa accaaacaga tacgagactg . tgtcataagg 1080 
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atccttccag gttctaagga taccaaaccc cacagcatgt gaatgaacac ttgccaggag 1140 

tgagaagaga aggaaagaat tacttctatt tgcctcttat gcaggagttc abaaaatcct 1200 

tctggaagta ctgtattgat cacaaaaugg agttcgntga ctggtgcatt ctcaataagt 1260 

accttgggaa tctnacatca ctggaaggcc caccacactt ctataaat 1308 

5 Also prefen-ed is a polynucleolide comprising nucleotides 157-1 119 of 

SEQ ID NO: 1, which represent the portion of SEQ ID NO; 1 that encodes CON 193 
amino acids. 

Another highly prefen-ed polynucleotide of the invention comprises the 
sequence set forth in SEQ ID NO: 3, which comprises a human CON166 encoding 
10 DN A sequence: 

atggatgaaa caggaaatct gacagtatct tctgccacat gccatgacac tattgatgac 60 
ttccgcaatc aagtgtattc caccttgtac tctaLgatct ctgttgtagg cttctttggc 120 
aatggctttg tgctctacgt cctcataaaa acctatcaca agaagtcagc cctccaagta 180 
tacatgatta atttagcagt agcagatcta ctctgtgtgt gcacactgcc tctccgtgtg 240 
15 gcccattatg ttcacaaagg catttggctc tttggtgact tcttgtgccg cctcagcacc 300 

tatgctttgt atgtcaacct ctattgtagc atcttcttta tgacagccat gagctttttc 360 
cggtgcattg caattgtttt tccagtccag aacattaatt tggttacaca gaaaaaagcc 420 
aggtttgtgt gtgtaggtat ttggattttt gtgattttga ccagttctcc atttctaatg 480 
gccaaaccac aaaaagatga gaaaaataat accaagtgct ttgagccccc acaagacaat 540 
20 caaactaaaa atcatgtttc ggtctcgcat tatgtgccat tgtttgttgg ctttatcatc 600 

ccttttgtta ttataattgt ctgttacaca atgatcattt tgaccttact aaaaaaatca 660 
atgaaaaaaa atctgtcaag tcataaaaag gctataggaa tgatcatggt cgtgaccgct 720 
gcctttttag tcagtttcat gccatatcat attcaacgta ccattcacct tcatttttta 780 
cacaatgaaa ctaaaccctg tgattctgtc cttagaatgc agaagtccgt ggtcataacc 840 
ttgtctctgg ctgcatccaa ttgttgcttt gaccctcccc tatatttctt ttctgggggt 900 
aactttagga aaaggctgtc tacatttaga aagcattctt tgtccagcgt gacttatgta 960 
cccagaaaga aggcctcttt gccagaaaaa ggagaagaaa tatgtaaagt atag 1014 

The final three nucleotides of this sequence represent a stop codon. 

Still another highly preferred polynucleotide of the invention 
30 comprises the sequence set forth in SEQ ID NO: 5, which comprises a human 
CON103 encoding DNA sequence: 

ggggcctact tcaccgtgta cccggacttg ggaccatcac agacttcaga accatcagga 60 
acctgggagc aactgaaagc tgaaccacag tgggctttca gacacacagc aggctgcgga 12 0 
gcacaaacag gactggttcc ctccaggcca ccagcagggc ggcggaggtc ttcactgact 180 
ccctgcctac ctctcaggac aatgtccttt tggctccaca gtccctgaag ccagagctgg 240 
tgggggcagg gaggcagcca ccagcctcta tatgtagtgg aggagggggt gtccagggag 3 00 
ggctgcatga tcctgagagc ccccacctca cccggctgga ctatcctccc acttcagggt 360 
ttctctgggc ttccatcctg cccctgctga gccctgcttc ctcctctacc agcagcacaa 420 
cccccaggct gggctcagag acctcatgtg gtgggatcac tcagtacccc gaggcggagg 480 
gaaggaggga gggctgcagg gttccccttg gcctgcaaac aggaacacag ggtgtttctc 540 
agtggctgcg agaatgctga tgaaaacccc aggatgttgt gtcaccgLgg tggccagctg 600 
atagtgccaa tcatcccact ttgccccgag cactcctgca ggggtagaag actccagaac 660 



25 



35 



40 
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ccuctctcag gcccatggcc caagcagccc atg gaa ctt cat aac ccg age tct 714 



cca 


tec 


cec 


tct 


etc 


tec 


tee 


tct 


gtt 


etc 


cct 


ccc 


tec 


ttc 


tct 


ccc 


762 


tea 


ccc 


ccc 


tct 


get 


ccc 


tct 


gcc 


ttt 


acc 


acc 


gtg 


ggg 


ggg 


tec 


tct 


810 


gga 


ggg 


ccc 


tgc 


cac 


ccc 


acc 


tct 


tec 


ccg 


ctg 


gtg 


tct 


gee 


ttc 


ctg 


858 


gca 


cca 


ate 


ctg 


gcc 


ctg 


gag 


ttt 


gtc 


ctg 


ggc 


ctg 


gtg 


ggg 


aac 


agt 


906 


ttg 


gee 


etc 


ttc 


ate 


ttc 


tgc 


ate 


cac 


acg 


egg 


ccc 


tgg 


acc 


tec 


aac 


954 


acg 


gtg 


ttc 


ctg 


gtc 


age 


ctg 


■gtg 


gcc 


get 


gac 


ttc 


etc 


ctg 


ate 


age 


1002 


aac 


ctg 


ccc 


etc 


cgc 


gtg 


gac 


tac 


tac 


etc 


etc 


eat 


gag 


acc 


tgg 


cgc 


1050 


ttt 


ggg 


get 


get 


gcc 


tgc 


aaa 


gtc 


aac 


etc 


ttc 


atg 


ccg 


tec 


acc 


aac 


1098 


cgc 


acg 


gcc 


age 


gtt 


gtc 


ttc 


etc 


aca 


gcc 


ate 


gca 


etc 


aac 


cgc 


tac 


1146 


ccg 


aag 


gtg 


gtg 


cag 


ccc 


cac 


cac 


gtg 


ctg 


age 


cgt 


get 


tec 


gtg 


ggg 


1194 


gca 


get 


gcc 


egg 


gtg 


gcc 


ggg 


gga 


etc 


tgg 


gtg 


ggc 


ate 


ctg 


etc 


etc 


1242 


aac 


ggg 


cac 


ctg 


etc 


ctg 


age 


ace 


ttc 


tec 


ggc 


ccc 


tec 


tgc 


etc 


age 


1290 


cac 


agg 


gtg 


ggc 


acg 


aag 


ccc 


teg 


gcc 


teg 


etc 


cgc 


tgg 


cac 


cag 


gca 


1338 


ctg 


tac 


ctg 


ctg 


gag 


ttc 


ttc 


ctg 


cca 


ctg 


gcg 


etc 


ate 


etc 


ttt 


get 


1386 


att 


gtg 


age 


att 


ggg 


etc 


acc 


ate 


egg 


aac 


egt 


ggt 


ctg 


ggc 


ggg 


cag 


1434 


gea 


ggc 


ccg 


cag 


agg 


gcc 


atg 


cgt 


gtg 


ctg 


gee 


atg 


gtg 


gtg 


gcc 


gtc 


1482 


tac 


acc 


ate 


tgc 


ttc 


ttg 


ccc 


age 


ate 


ate 


ttt 


ggc 


atg 


get 


tec 


atg 


1530 


gtg 


get 


ttc 


tgg 


ctg 


tec 


gcc 


tgc 


cga 


tec 


ctg 


gac 


etc 


tgc 


aca 


cag 


1578 


etc 


ttc 


cat 


ggc 


tec 


ctg 


gcc 


ttc 


acc 


tac 


etc 


aac 


agt 


gtc 


ctg 


gac 


1626 


ccc 


gtg 


etc 


tac 


tgc 


ttc 


tct 


age 


ccc 


aac 


ttc 


etc 


cac 


cag 


age 


egg 


1674 


gee 


ttg 


ctg 


ggc 


etc 


acg 


egg 


ggc 


egg 


cag 


ggc 


cca 


gtg 


age 


gac 


gag 


1722 


age 


tec 


tac 


caa 


ccc 


tec 


agg 


cag 


tgg 


cgc 


tac 


egg 


gag 


gcc 


tct 


agg 


1770 


aag 


geg 


gag 


gcc 


ata 


ggg 


aag 


ctg 


aaa 


gtg 


cag 


ggc 


gag 


gtc 


tct 


ctg 


1818 


gaa 


aag 


gaa 


ggc 


tec 


tee 


cag 


ggc 


tga 


gggccagctg cagggctgca 




1865 



gcgetgtggg ggtaagggct gcegcgctct ggcctggagg gacaaggcca gcacacggtg 1925 
cetcaaccaa ctggacaagg gatggcggca gaccaggggc caggccaaag cactggcagg 1985 
acteatgtgg gtggcaggga gagaaaccca ectaggcetc tcagtgtgtc caggatggca 2045 
ttcceagaat gcaggggaga gcaggatgcc gggtggagga gacaggcaag gtgeegttgg 2105 
cacaecaget cagacagggg cctgegcage tgcaggggac agacgecaat cactgtcaca 2165 
gcagagtcac cttagaaatt ggacagctgc atgttctgtg ctctccagtt tgtcccttcc 2225 
aatattaata aacttccctt ttaaatatat ttatttgcag accaatatct gtctttaatt 2285 
ctaacctggg aetgtcagta ggcgtcaaag tgagcgcccc agtgaaggaa cettggagag 2345 
agtgggagca ttcccagcct tccaggggga ctcgtcctcc agactttgga gcccgcatgt 2405 
ctgaagcaga ctetttcttg gtag 2429 

Also preferred is a polynucleotide comprising nucleotides 691-1842 of SEQ ID NO: 
5, which represent the portion of SEQ ID NO: 5 that encodes CON103 amino acids. 
Nucleotides 1 843-1 845 represent a stop codon. 

Another highly preferred polynucleotide of the invention comprises the 
sequence set forth in SEQ ID NO: 7, which comprises a CON203-encoding DNA 
sequence: 

ttgaatttag gtgaeactat agaagagcta tgaegtcgca tgcacgcgta cgtaagctcg 60 
gaattcggct egagctgaac taatgaetgc cgceataaga agacagagag aactgagtat 120 
cctcceaaag gtgacacLgg aagcaatgaa caccacagtg atgeaaggct tcaacagate 180 
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tgagcggtgc cccagagaca cccggatagt acagctggca tccccagccc cctacacagc 240 
ggtttucttg accggcatcc tgccgaatac tttggctctg tgggtgtttg ttcacatccc 300 
cagctcctcc acctccatca tctacctcaa aaacactttg gtggccgaci; tgacaatgac 360 
acccacgctt: cctttcaaaa tcctctctga ctcacacctg gcaccctggc agctcagagc 420 
ccccgcgtgt cgcttcuctc cggtgatatt ttatgagacc atgcacgtgg gcatcgtgct 480 
gtcagggctc atagcctttg acagattcct caagatcatc agacctttga gaaatatttt 540 
tccaaaaaaa cctgtttttg caaaaacggt ctcaatcttc atctgggtct ttttggtctt 600 
catctccctg ccaaatatga tcttgagcaa caaggaagca acaccatcgt ctgtgaaaaa 660 
gcgcgcttcc tcaaaggggc ctctggggct gaaatggcat caaatggtaa ataacatatg 720 
ccagtctatt ttctggactg gttttatcct aatgcttgtg ttctacgtgg ttatcgcaaa 780 
aaaagcacat gattcctata gaaagtccaa aagtaaggac agaaaaaaca acaaaaagct 840 
ggaaggcaaa gtatttgttg tcgtggctgt cttctttgtg tgttttgctc catttcattt 900 
tgccagagtt ccatatactc acagtcaaac caacaataag actgactgta gactgcaaaa 960 
tcaactigttc actgctaaag aaacaactct ctttttggca gcaaccaaca cttgtatgga 1020 
tcccttaata tacatattcc tatgtaaaaa attcacagaa aagctaccat gtacgcaagg 1080 
gagaaagacc acagcatcaa gccaagaaaa tcatagcagt cagacagaca acacaacctt 1140 
aggctgacaa ctgtacatag ggttaacctc tatttattga tgagacttcc gtagataatg 1200 
tggaaatcaa atttaaccaa gaaaaaaaga ttggaacaaa tgctcbctta cattttattt 1260 
atcctggtgt ccaggaaaag attatattaa atttaaatcc acatagatct attcataagc 1320 
tgaatgaacc attacctaag agaatgcaac aggataccaa tggccactag aggcatattc 1380 
cttcttcttt tttttttgtt aaatttcaag agcattcact ttacatttgg aaagactaag 1440 
gggaacggtt atcctacaaa cctcccttca acacctttta catt 1484 

Also preferred is a polynucleotide comprising nucleotides 146-1 144 of SEQ ID NO; 

7, which represent the portion of SEQ ID NO: 7 that encodes CON203 amino acids. 

Nucleotides 11 45- 11 47 represent a stop codon. 

Another highly preferred polynucleotide of the invention comprises the 

sequence set forth in SEQ ID NO: 9, which comprises a human CON198 encoding 

DNA sequence: 

ATGATGGTGG ATCCCAATGG CAATGAATCC AGTGCTACAT ACTTCATCCT AATAGGCCTC 60 
CCTGGTTTAG AAGAGGCTCA GTTCTGGTTG GCCTTCCCAT TGTGCTCCCT CTACCTTATT 120 
GCTGTGCTAG GTAACTTGAC AATCATCTAC ATTGTGCGGA CTGAGCACAG CCTGCATGAG 180 
CCCATGTATA TATTTCTTTG CATGCTTTCA GGCATTGACA TCCTCATCTC CACCTCATCC 24 0 
ATGCCCAAAA TGCTGGCCAT CTTCTGGTTC AATTCCACTA CCATCCAGTT TGATGCTTGT 3 00 
CTGCTACAGA TGTTTGCCAT CCACTCCTTA TCTGGCATGG AATCCACAGT GCTGCTGGCC 360 
ATGGCTTTTG ACCGCTATGT GGCCATCTGT CACCCACTGC GCCATGCCAC AGTACTTACG 420 
TTGCCTCGTG TCACCAAAAT TGGTGTGGCT GCTGTGGTGC GGGGGGCTGC ACTGATGGCA 480 
CCCCTTCCTG TCTTCATCAA GCAGCTGCCC TTCTGCCGCT CCAATATCCT TTCCCATTCC 54 0 
TACTGCCTAC ACCAAGATGT CATGAAGCTG GCCrCTGATG ATATCCGGGT CAATGTCGTC 600 
TATGGCCTTA TCGTCATCAT CTCCGCCATT GGCCTGGACT CACTTCTCAT CTCCTTCTCA 660 
TATCTGCTTA TTCTTAAGAC TGTGTTGGGC TTGACACGTG AAGCCCAGGC CAAGGCATTT 720 
GGCACTTGCG TCTCTCATGT GTGTGCTGTG TTCATATTCT ATGTACCTTT CATTGGATTG 7 80 
TCCATGGTGC ATCGCTTTAG CAAGCGGCGT GACTCTCCGC TGCCCGTCAT CTTGGCCAAT 84 0 
ATCTATCTGC TGGTTCCTCC TGTGCTCAAC CCAATTGTCT ATGGAGTGAA GACAAAGGAG 90O 
ATTCGACAGC GCATCCTTCG ACTTTTCCAT GTGGCCACAC ACGCTTCAGA GCCCTAG 957 
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The last three nucleotides of this sequence represent a stop codon. 

Stilt another A highly preferred polynucleotide of the invention 
comprises tiie sequence set forth in SEQ [D NO: 1 1 , which comprises a human 
CON 197 encoding DNA sequence: 



ATGGAAAGCG 


AGAACAGAAG 


AGTGATAAGA 


GAATTCATCC 


TCCTTGGTCT 


GACCCAGTCT 


60 


CAAGATATTC 


AGCTCCTGGT 


CTTTGTGCTA 


GTTTTAATAT 


TCTACTTCAT 


CATCCTCCCT 


120 


GGAAATTTTC 


TCATTATTTT 


CACCATAAAG 


TCAGACCCTG 


GGCTCACAGC 


CCCCCTCTAT 


180 


TTCTTTCTGG 


GCAACTTGGC 


CTTCCTGGAT 


GCATCCTACT 


CCTTCATTGT 


GGCTCCCCGG 


240 


ATGTTGGTGG 


ACTTCCTCTC 


TGCGAAGAAG 


ATAATCTCCT 


ACAGAGGCTG 


CATCACTCAG 


300 


CTCTTTTTCT 


TGCACTTCCT 


TGGAGGAGGG 


GAGGGATTAC 


TCCTTGTTGT 


GATGGCCTTT 


360 


GACCGCTACA 


TCGCCATCTG 


CCGGCCTCTG 


CACTATCCTA 


CTGTCATGAA 


CCCTAGAACC 


420 


TGCTATGCAA 


TGATGTTGGC 


TCTGTGGCTT 


GGGGGTTTTG 


TCCACTCCAT 


TATCCAGGTG 


480 


GTCCTCATCC 


TCCGCTTGCC 


TTTTTGTGGC 


CCAAACCAGC 


TGGACAACTT 


CTTCTGTGAT 


540 


GTCCCACAGG 


TCATCAAGCT 


GGCCTGCACC 


GACACATTTG 


TGGTGGAGCT 


TCTGATGGTC 


600 


TTCAACAGTG 


GCCTGATGAC 


ACTCCTGTGC 


TTTCTGGGGC 


TTCTGGCCTC 


CTATGCAGTC 


660 


ATTCTTTGTC 


GCATACGAGG 


GTCTTCTTCT 


GAGGCAAAAA 


ACAAGGCCAT 


GTCCACGTGC 


720 


ATCACCCATA 


TCATTGTTAT 


ATTCTTCATG 


TTTGGACCTG 


GCATCTTCAT 


CTACACGCGC 


780 


CCCTTCAGGG 


CTTTCCCAGC 


TGACAAGGTG 


GTTTCTCTCT 


TCCACACAGT 


GATTTTTCCT 


840 


TTGTTGAATC 


CTGTCATTTA 


TACCCTTCGC 


AACCAGGAAG 


TGAAAGCTTC 


CATGAAAAAG 


900 


GTGTTTAATA 


AGCACATAGC 


CTGA 924 











The last three nucleotides of this sequence represent a stop codon. 

Another highly preferred polynucleotide of the invention comprises the 
sequence set forth in SEQ ID NO: 13, which comprises a human CON202 encoding 
DNA sequence: 

1 TGCTTCCCCA TAAGGTAACA GCTTTGTTAG CNCTGTCTGA CATCATTGCT 
51 TGTTWACTTA AGAACTGATA GGTYTTTTTT TTTTTTTTTT TTCAGATATT 
101 CTGATGGCAA AACAAGTGGA AGAAAAGAGG AAGCATGACT GCAGATCAGA 
151 TCAGTTCTCT TTGTGGATTA TATTTTCAGT AAAATGTATG GATCTATCTT 

2 01 TTCCTTGTTC TTATATCTAG ATCATGAGAC TTGACTGAGG CTGTATCCTT 
251 ATCCTCCATC CATCTATGGC GAACTATAGC CATGCAGCTG ACAACATTTT 
301 GCAAAATCTC TCGCCTCTAA CAGCCTTTCT GAAACTGACT TCCTTGGGTT 

3 51 TCATAATAGG AGTCAGCGTG GTGGGCAACC TCCTGATCTC CATTTTGCTA 
401 GTGAAAGATA AGACCTTGCA TAGAGCACCT TACTACTTCC TGTTGGATCT 
451 TTGCTGTTCA GATATCCTCA GATCTGCAAT TTGTTTCCCA TTTGTGTTCA 
501 ACTCTGTCAA AAATGGTTCT ACCTGGACTT ATGGGACTCT GACTTGCAAA 
551 GTGATTGCCT TTCTGGGGGT TTTGTCCTGT TTCCACACTG CTTTCATGCT 
601 CTTCTGCATC AGTGTCACCA GATATTTAGC TATCGCCCAT CACCGCTTCT 
651 ATACAAAGAG GCTGACCTTT TGGACGTGTC TGGCTGTGAT CTGTATGGTG 
701 TGGACTCTGT CTGTGGCCAT GGCATTTCCC CCGGTTTTAG ACGTGGGCAC 
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751 


TTACTCATTC 


ATTAGGGAGG 


AAGATCAATG 


CACCTTCCAA 


CACCGCTCCT 


801 


TCAGGGCTAA 


TGATTCCTTA 


GGATTTATGC 


TGCTTCTTGC 


TCTCATCCTC 


851 


CTAGCCACAC 


AGCTTGTCTA 


CCTCAAGCTG 


ATATTTTTCG 


TCCACGATCG 


901 


AAGAAAAATG 


AAGCCAGTCC 


AGTTTGTAGC 


AGCAGTCAGC 


CAGAACTGGA 


951 


CTTTTCATGG 


TCCTGGAGCC 


AGTGGCCAGG 


CAGCTGCCAA 


TTGGCTAGCA 


1001 


GGATTTGGAA 


GGGGTCCCAC 


ACCACCCACC 


TTGCTGGGCA 


TCAGGCAAAA 


1051 


.TGCAAACACC 


ACAGGCAGAA 


GAAGGCTATT 


GGTCTTAGAC 


GAGTTCAAAA 


1101 


TGGAGAAAAG 


AATCAGCAGA 


ATGTTCTATA 


TAATGACTTT 


TCTGTTTCTA 


1151 


ACCTTGTGGG 


GCCCCTACCT 


GGTGGCCTGT 


TATTGGAGAG 


TTTTTGCAAG 


1201 


AGGGCCTGTA 


GTACCAGGGG 


GATTTCTAAC 


AGCTGCTGTC 


TGGATGAGTT 


1251 


TTGCCCAAGC 


AGGAATCAAT 


CCTTTTGTCT 


GCATTTTCTC 


AAACAGGGAG 


1301 


CTGAGGCGCT 


GTTTCAGCAC 


AACCCTTCTT 


TACTGCAGAA 


AATCCAGGTT 


1351 


ACCAAGGGAA 


CCTTACTGTG 


TTATATGAGG 







Also preferred is a polynucleotide comprising nucleotides 266-1375 of SEQ TD NO: 
13, which represent the portion of SEQ ID NO: 13 that encodes CON202 amino acids. 
Nucleotides 1376-1 378 represent a stop codon. 

Another highly preferred polynucleotide of the invention comprises the 
sequence set forth in SEQ ID NO: 1 5, which comprises a human CON222 encoding 



DNA sequence: 










1 


ATGTTTAGAC 


CTCTTGTGAA 


TCTCTCTCAC 


ATATATTTTA 


AGAAATTCCA 


51 


GTACTGTGGG 


TATGCACCAC 


ATGTTCGCAG 


CTGTAAACCA 


AACACTGATG 


101 


GAATTTCATC 


TCTAGAGAAT 


CTCTTGGCAA 


GCATTATTCA 


GAGAGTATTT 


151 


GTCTGGGTTG 


TATCTGCAGT 


TACCTGCTTT 


GGAAACATTT 


TTGTCATTTG 


201 


GATGCGACCT 


TATATCAGGT 


CTGAGAACAA 


GCTGTATGCC 


ATGTC/^TCA 


251 


TTTCTCTCTG 


CTGTGCCGAC 


TGCTTAATGG 


GAATATATTT 


ATTCGTGATC 


301 


GGAGGCTTTG 


ACCTAAAGTT 


TCGTGGAGAA 


TACAATAAGC 


ATGCGCAGCT 


351 


GTGGATGGAG 


AGTACTCATT 


GTCAGCTTGT 


AGGATCTTTG 


GCCATTCTGT 


401 


CCACAGAAGT 


ATCAGTTTTA 


CTGTTAACAT 


TTCTGACATT 


GGAAAAATAC 


451 


ATCTGCATTG 


TCTATCCTTT 


TAGATGTGTG 


AGACCTGGAA 


AATGCAGAAC 


501 


AATTACAGTT 


CTGATTCTCA 


TTTGGATTAC 


TGGTTTTATA 


GTGGCTTTCA 


551 


TTCCAl'TGAG 


CAATAAGGAA 


TTTTTCAAAA 


ACTACTATGG 


CACCAATGGA 


601 


GTATGCTTCC 


CTCTTCATTC 


AGAAGATACA 


GAAAGTATTG 


GAGCCCAGAT 


651 


TTATTCAGTG 


GCAATTTTTC 


TTGGTATTAA 


TTTGGCCGCA 


TTTATCATCA 


701 


TAGTTTTTTC 


CTATGGAAGC 


ATGTTTTATA 


GTGTTCATCA 


AAGTGCCATA 


751 


ACAGCAACTG 


AAATACGGAA 


TCAAGTTAAA 


AAAGAGATGA 


TCCTTGCCAA 


801 


ACGTTTTTTC 


TTTATAGTAT 


TTACTGATGC 


ATTATGCTGG 


ATACCCATTT 


851 


TTGTAGTGAA 


ATTTCTTTCA 


CTGCTTCAGG 


TAGAAATACC 


AGGTACCATA 


901 


ACCTCTTGGG 


TAGTGATTTT 


TATTCTGCCC 


ATTAACAGTG 


CTTTGAACCC 


951 


AATTCTCTAT 


ACTCTGACCA 


CAAGACCATT 


ThAAGAAATG 


ATTCATCGGT 


1001 


TTTGGTATAA 


CTACAGACAA 


AGAAAATCTA 


TGGACAGCAA 


AGGTCAGAAA 


1051 


ACATATGCTC 


CATCATTCAT 


CTGGGTGGAA 


ATGTGGCCAC 


TGCAGGAGAT 


1101 


GCCACCTGAG 


TTAATGAAGC 


CGGACCTTTT 


CACATACCCC 


TGTGAAATGT 
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1151 CACTGATTTC TCAATCAACG AGACTCAATT CCTATTCA 

The last three nucleotides of this sequence represent a stop codon. 

Another highly preferred polynucleotide of the invention comprises the 
sequence set forth in SEQ ID NO: 1 7, which comprises a human CON215 encoding 
DNA sequence. Also preferred is a polynucleotide comprising the portion of SEQ ID 
NO: 17 set forth below, which represent the portion of SEQ ID NO: 17 that encodes 



CON215 amino acids (the last three nucleotides represent a stop codon). 



ATGGGGTTCA 


ACTTGACGCT 


TGCAAAATTA 


CCAAATAACG 


AGCTGCACGG 


CCAAGAGAGT 


60 


CACAATTCAG 


GCAACAGGAG 


CGACGGGCCA 


GGAAAGAACA 


CCACCCTTCA 


CAATGAATTT 


120 


GACACAATTG 


TCTTGCCAGT 


GCTTTATCTC 


ATTATATTTG 


TGGCAAGCAT 


CTTGCTGAAT 


180 


GGTTTAGCAG 


TGTGGATCTT 


CTTCCACATT 


AGGAATAAAA 


CCAGCTTCAT 


ATTCTATCTC 


240 


AAAAACATAG 


TGGTTGCAGA 


CCTCATAATG 


ACGCTGACAT 


TTCCATTTCG 


AATAGTCCAT 


30O 


GATGCAGGAT 


TTGGACCTTG 


GTACTTCAAG 


TTTATTCTCT 


GCAGATACAC 


TTCAGTTTTG 


360 


TTTTATGCAA 


ACATGTATAC 


TTCCATCGTG 


TTCCTTGGGC 


TGATAAGCAT 


TGATCGCTAT 


420 


CTGAAGGTGG 


TCAAGCCATT 


TGGGGACTCT 


CGGATGTACA 


GCATAACCTT 


CACGAAGGTT 


480 


TTATCTGTTT 


GTGTTTGGGT 


GATCATGGCT 


GTTTTGTCTT 


TGCCAAACAT 


CATCCTGACA 


540 


AATGGTCAGC 


CAACAGAGGA 


CAATATCCAT 


GACTGCTCAA 


AACTTAAAAG 


TCCTTTGGGG 


600 


GTCAAATGGC 


ATACGGCAGT 


CACCTATGTG 


AACAGCTGCT 


TGTTTGTGGC 


CGTGCTGGTG 


660 


ATTCTGATCG 


GATGTTACAT 


AGCCATATCC 


AGGTACATCC 


ACAAATCCAG 


CAGGCAATTC 


720 


ATAAGTCAGT 


CAAGCCGAAA 


GCGAAAACAT 


AACCAGAGCA 


TCAGGGTTGT 


TGTGGCTGTG 


780 


TTTTTTACCT 


GCTTTCTACC 


ATATCACTTG 


TGCAGAATTC 


CTTTTACTTT 


TAGTCACTTA 


840 


GACAGGCTTT 


TAGATGAATC 


TGCACAAAAA 


ATCCTATATT 


ACTGCAAAGA 


AATTACACTT 


900 


TTCTTGTCTG 


CGTGTAATGT 


TTGCCTGGAT 


CC7\ATAATTT 


ACTTTTTCAT 


GTGTAGGTCA 


960 


TTTTCAAGAA 


GGCTGTTCAA 


AAAATCAAAT 


ATCAGAACCA 


GGAGTGAAAG 


CATCAGATCA 


1020 


CTGCAAAGTG 


TGAGAAGATC 


GGAAGTTCTC 


ATATATTATG 


ATTATACl'GA 


TGTGTAG 


1077 



Another prefeixed polynucleotide of the invention comprises the 
portion of the sequence set forth in SEQ DD NO: 19 which comprises a human 
CON217 encoding DNA sequence: 



1 


ATGTTAGCCA 


ACAGCTCCTC 


AACCAACAGT 


TCTGTTCTCC 


CGTGTCCTGA 


CTACCGACCT 


61 


ACCCACCGCC 


TGCACTTGGT 


GGTCTACAGC 


TTGGTGCTGG 


CTGCCGGGCT 


CCCCCTCAAC 


121 


GCGCTAGCCC 


TCTGGGTCTT 


CCTGCGCGCG 


CTGCGCGTGC 


ACTCGGTGGT 


GAGCGTGTAC 


181 


ATGTGTAACC 


TGGCGGCCAG 


CGACCTGCTC 


UTCACCCTCT 


CGCTGCCCGT 


TCGTCTCTCC 


241 


TACTACGCAC 


TGCACCACTG 


GCCCTTCCCC 


GACCTCCTGT 


GCCAGACGAC 


GGGCGCCATC 


301 


TTCCAGATGA 


ACATGTACGG 


CAGCTGCATC 


TTCCTGATGC 


TCATCAACGT 


GGACCGCTAC 


361 


GCCGCCATCG 


TGCACCCGCT 


GCGACTGCGC 


CACCTGCGGC 


GGCCCCGCGT 


GGCGCGGCTG 


421 


CTCTGCCTGG 


GCGTGTGGGC 


GCTCATCCTG 


GTGTTTGCCG 


TGCCCGCCGC 


CCGCGTGCAC 


481 


AGGCCCrCGC 


GTTGCCGCTA 


CCGGGACCTC 


GAGGTGCGCC 


TATGCTTCGA 


GAGCTTCAGC 


541 


GACGAGCTGT 


GGAAAGGCAG 


GCTGCTGCCC 


CTCGTGCTGC 


TGGCCGAGGC 


GCTGGGCTTC 


601 


CTGCTGCCCC 


TGGCGGCGGT 


GGTCTACTCG 


TCGGGCCGAG 


TCTTCTGGAC 


GCTGGCGCGC 


661 


CCCGACGCCA 


CGCAGAGCCA 


GCGGCGGCGG 


AAGACCGTGC 


GCCTCCTGCT 


GGCTAACCTC 


721 


GTCATCTTCC 


TGCTGTGCTT 


CGTGCCCTAC 


AACAGCACGC 


TGGCGGTCTA 


CGGGCTGCTG 


781 


CGGAGCAAGC 


TGGTGGCGGC 


CAGCGTGCCT 


GCCCGCGATC 


GCGTGCGCGG 


GGTGCTGATG 
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84 1 GTGATGGTGC TGCTGGCCGG CGCCAACTGC GTGCTGGACC CGCTGGTGTA CrACTTTAGC 
901 CCCGAGGGCT TCCGCAACAC CCTGCGCGGC CTGGGCACTC CGCACCGGGC CAGGACCTCG 
951 GCCACCAACG GGACGCGGGC GGCGCTCGCG CAATCCGAAA GGTCCGCCGT CACCACCGAC 
10 21 GCCACCAGGC CGGATGCCGC CAGTCAGGGG CTGCTCCGAC CCTCCGACTC CCACTCTCTG 
5 1081 TCTTCCTTCA CACAGTGTCC CCAGGATTCC GCCCTCTGA 

The last three nucleotides of this sequence represent a stop codon. 

The invention also includes polynucleotides differing from the 
sequences set forth in SEQ ID NOS: 1 , 3, 5, 7, 9, 11 , 1 3 J 5, 1 7 and 19 and from their 
complementary strand by at least one nucleotide. 

10 In a related embodiment, the invention provides vectors comprising a 

polynucleotide of the invention. Such vectors are useful, e.g., for amplifying the 
polynucleotides in host cells to create useful quantities thereof. In preferred 
embodiments, the vector is an expression vector wherein the polynucleotide of the 
invention is operatively Hnked to a polynucleotide comprising an expression control 

1 5 sequence. Such vectors are useful for recombinant production of polypeptides of the 

invention. 

In another related embodiment, the invention provides host cells that 
are transformed or transfected (stably or transiently) with a polynucleotide of the 
invention or vectors of the invention. As stated above, such host cells are useful for 

20 amplifying the polynucleotides and also for expressing the GPCR seven 

transmembrane receptor polypeptides or fragments thereof encoded by the 
polynucleotides. Such host cells are useful in assays as described herein. 

hi still another related embodiment, the invention provides a method 
for producing a seven transmembrane receptor polypeptide (or fragment thereof) of 

25 the invention comprising the steps of growing a host cell of the invention in a nutrient 
medium and isolating the polypeptide or variant thereof from the cell or the medium. 
Since the GPCR polypeptides are seven transmembrane receptors, it will be 
appreciated that, for some applications, such as certain activity assays, the preferable 
isolation may involve isolation of cell membranes containing the polypeptide 

30 embedded therein, whereas for other applications a more complete isolation may be 

preferable. 

In still another embodiment, the invention provides antibodies that are 
specific for the GPCR seven transmembrane receptors of the invention. Antibody 
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specificity is described in greater detail below. However, it should be emphasized 
that antibodies that can be generated from polypeptides that have previously been 
described in the literature and that are capable of fortuitously cross-reacting with the 
GPCR polypeptides of the invention {e.g., due to the fortuitous existence of a similar 
5 epitope in both polypeptides) are considered "cross-reactive" antibodies. Such cross- 

reactive antibodies are not antibodies thai are "specific'* for the GPCR polypeptides. 
The determination of whether an antibody is specific for a GPCR polypeptide or is 
cross-reactive with another known receptor is made using Western blotting assays or 
several other assays well known in the literature. For identifying cells that express 

10 GPCR polypeptides and also for modulating GPCR -ligand binding activity, 

antibodies that specifically bind to an extracellular epitope of one of the GPCR seven 
transmembrane receptors of the present invention are preferred. 

In one preferred variation, tlie invention provides monoclonal 
antibodies. Hybridomas that produce such antibodies also are intended as aspects of 

15 the invention. In yet another variation, the invention provides a humanized antibody. 

Humanized antibodies are useful for in vivo therapeutic indications. 

In another variation, the invention provides a cell-free composition 
comprising polyclonal antibodies, wherein at least one of the antibodies is an antibody 
of the invention specific for a GPCR polypeptide of the present invention. Antisera 

20 isolated from an animal is an exemplary composition, as is a composition comprising 

an antibody fraction of an antisera that has been resuspended in water or in another 
diluent, excipient, or cairier. 

In still another related embodnnent, the invention provides 
anti-idiotypic antibodies specific for an antibody that is specific for a GPCR 

25 polypeptide of the present invention. 

It is well known that antibodies contain relatively small antigen 
binding domains that can be isolated chemically or by recombinant techniques. Such 
domains are useful GPCR binding molecules themselves, and also may be 
reintroduced into human antibodies, or fused to toxins or other polypeptides. Thus, in 

30 still another embodiment, the invention provides a polypeptide comprising a fragment 

of a GPCR-specific antibody, wherein the fragment and the polypeptide bind to a 
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GPCR seven transmembrane receptor of the present invention. By way of non- 
limiting example, the invention provides polypeptides that are single chain antibodies 
and CDR-grafted antibodies. 

Also within the scope of the invention are compositions comprising 
5 polypeptides, polynucleotides, or antibodies of the invention that have been 

formulated with, e.g., a pharmaceutically acceptable carrier. 

The invention also provides methods of using antibodies of the 
invention. For example, the invention provides a method for modulating ligand 
binding of a GPCR seven transmembrane receptor of the present invention comprising 
to the step of contacting the seven transmembrane receptor with an antibody specific for 

the seven transmembrane receptor, under conditions wherein the antibody binds the 
receptor. 

GPCR polypeptides are expressed in the brain, providing an indication 
that aberrant GPCR polypeptide signaling activity may correlate with one or more 

1 5 neurological disorders. The invention also provides a method for treating a 

neurological disorder comprising the step of administering to a mammal in need of 
such treatment an amount of an antibody-like polypeptide of the invention that is 
sufficient to modulate ligand binding of a GPCR seven transmembrane receptor of the 
present invention in neurons of the mammal. In addition to administration of 

20 antibody-like polypeptides, administration of natural ligands for GPCR polypeptides 
as well as modulators of GPCR polypeptide activity, such as small molecules that 
mimic, agonize or antagonize ligand-mediated GPCR polypeptide signaling, are 
contemplated. The expression pattern provides an indication that such molecules will 
have utility for treating neurological and/or psychiatric diseases, including but not 

25 limited to schizophrenia, depression, anxiety, bipolar disease, affective disorders, 

attention deficit hyperactivity disorder/attention deficit disorder (ADHD/ ADO), 
epilepsy, neuritis, neurasthenia, neuropathy, neuroses, Alzheimer's disease, 
Parkinson's disease, migraine, senile dementia, and the like. Treatment of individuals 
having any of these disorders is contemplated as an aspect of the invention. 

30 Thus, in yet another embodiment, the invention provides genetic 

screening procedures that entail analyzing a person's genome - in particular their 
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alleles for GPCR's of the invention - to determine whether the individual possesses a 
genetic characteristic found in other individuals that are considered to be afflicted 
with, or at risk for, developing a menial disorder or disease of the brain that is 
suspected of having a hereditary component. For example, in one embodiment, the 
invention provides a method for detennining a potential for developing a disorder 
affecting the brain in a human subject comprising the steps of analyzing the coding 
sequence of one or more GPCR genes from the human subject; and determining 
development potential for the disorder in said human subject from the analyzing step. 



human subject to diagnose a disorder affecting the brain or genetic predisposition 
therefor, comprising the steps of: (a) assaying nucleic acid of a human subject to 
determine a presence or an absence of a mutation altering the amino acid sequence, 
expression, or biological activity of at least one seven transmembrane receptor that is 
expressed in the brain, wherein the seven transmembrane receptor comprises an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 
10, 12, 14, 16, 18, and 20, or an allelic variant thereof, and wherein the nucleic acid 
corresponds to the gene encoding the seven transmembrane receptor; and (b) 
diagnosing the disorder or predisposition from the presence or absence of said 
mutation, wherein the presence of a mutation altering the amino acid sequence, 



More particularly, the invention provides a method of screening a 
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GPCR seven transmembrane receptor of the present invention. By way of non- 
limiting example, the invention provides polypeptides that are single chain antibodies 
and CDR-grafted antibodies. 

Also within the scope of the invention are compositions comprising 
5 polypeptides, polynucleotides, or antibodies of the invention that have been 

formulated with, e.g., a pharmaceutically acceptable carrier. 

The invention also provides methods of using antibodies of the 
invention. For example, the invention provides a method for modulating ligand 
binding of a GPCR seven transmembrane receptor of the present invention comprising 
10 the step of contacting the seven transmembrane receptor with an antibody specific for 
the seven transmembrane receptor, under conditions wherein the antibody binds the 
receptor. 

GPCR polypeptides are expressed in the brain, providing an indication 
that aberrant GPCR polypeptide signaling activity may correlate with one or more 

15 neurological disorders. The invention also provides a method for treating a 

neurological disorder comprising the step of administering to a mammal in need of 
such treatment an amount of an antibody-like polypeptide of the invention that is 
sufficient to modulate ligand binding of a GPCR seven transmembrane receptor of the 
present invention in neurons of the mammal. In addition to administration of 

20 antibody-like polypeptides, administration of natural ligands for GPCR polypeptides 
as well as modulators of GPCR polypeptide activity, such as small molecules that 
mimic, agonize or antagonize ligand-niediated GPCR polypeptide signaling, are 
contemplated. The expression pattern provides an indication that such molecules will 
have utility for treating neurological and/or psychiatric diseases, including but not 

25 limited to schizophrenia, depression, anxiety, bipolar disease, affective disorders, 

attention deficit hyperactivity disorder/attention deficit disorder (ADHD/ ADO), 
epilepsy, neuritis, neurasthenia, neuropathy, neuroses, Alzheimer's disease, 
Parkinson's disease, migraine, senile dementia, and the like. Treatment of individuals 
having any of these disorders is contemplated as an aspect of the invention. 

30 Thus, in yet another embodiment, the invention provides genetic 

screening procedures that entail analyzing a person's genome - in particular their 
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alleles for GPCR's of the invention -- to determine whether the individual possesses a 
genetic characteristic found in other individuals that are considered to be afflicted 
with, or at risk for, developing a mental disorder or disease of the brain that is 
suspected of having a hereditary component. For example, in one embodiment, the 
invention provides a method for detennining a potential for developing a disorder 
affecting the brain in a human subject comprising the steps of analyzing the coding 
sequence of one or more GPCR genes from the human subject; and determining 
development potential for the disorder in said human subject from the analyzing step. 

More particularly, the invention provides a method of screening a 
human subject to diagnose a disorder affecting the brain or genetic predisposition 
therefor, comprising the steps of: (a) assaying nucleic acid of a human subject to 
determine a presence or an absence of a mutation altering the amino acid sequence, 
expression, or biological activity of at least one seven transmembrane receptor that is 
expressed in the brain, wherein the seven transmembrane receptor comprises an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 
10, 12, 14, 16, 18, and 20, or an alleJic variant thereof, and wherein the nucleic acid 
corresponds to the gene encoding the seven transmembrane receptor; and (b) 
diagnosing the disorder or predisposition from the presence or absence of said 
mutation, wherein the presence of a mutation altering the amino acid sequence, 
expression, or biological activity of allele in the nucleic acid correlates with an 
increased risk of developing the disorder, hi preferred variations, the seven 
transmembrane receptor is CON202 comprising an amino acid sequence set forth in 
SEQ ID NO: 14, or an allelic variant thereof, and the disease is schizophrenia. 

By "human subject" is meant any human being, human embryo, or 
human fetus. It will be apparent that methods of the present invention will be of 
particular interest to individuals that have themselves been diagnosed with a disorder 
affecting the brain or have relatives that have been diagnosed with a disorder affecting 
the brain. 

By "screening for an increased risk" is meant determination of whether 
a genetic variation exists in the human subject that correlates with a greater likelihood 
of developing a disorder affecting the brain than exists for the human population as a 



wo 01/31014 



PCT/USOO/29601 



-32- 

whole, or for a relevant racial or ethnic human sub-population to which the individual 
belongs. Both positive and negative detemTinations (i.e., detenninations that a genetic 
predisposition marker is present or is absent) are intended to fall within the scope of 
screening methods of the invention. In prefen*ed embodiments, the presence of a 
5 mutation altering the sequence or expression ofat least one CON202 seven 

transmembrane receptor allele in the nucleic acid is correlated with an increased risk 
of developing schizophrenia, whereas the absence of such a mutation is reported as a 
negative detemiination. 

The "assaying" step of the invention may involve any techniques 

1 0 available for analyzing nucleic acid to determine its characteristics, including but not 

limited to well-known techniques such as single-strand conformation polymorphism 
analysis (SSCP) [Orita et ai, Proc Natl. Acad. Set USA, 86: 2766-2770 (1989)]; 
heteroduplex analysis [White e/ a/.. Genomics, 12: 301-306 (1992)]; denaturing 
gradient gel electrophoresis analysis [Fischer et al, Proc. Natl Acad. Sci. USA, 80: 

15 1579-1583 (1983); and Riesner e/ ai. Electrophoresis, JO: 377-389 (1989)]; DNA 

sequencing; RNase cleavage [Myers et al. Science, 230: 1242-1246 (1985)]; chemical 
cleavage of mismatch techniques [Rowley et aL, Genomics, 30: 574-582 (1995); and 
Roberts et al., NucL Acids Res,, 25: 3311-331% (1997)]; restriction fragment length 
polymorphism analysis; single nucleotide primer extension analysis [Shumaker et al, 

20 Hum. Mutat., 7: 346-354 (1996); and Pastinen et al, Genome Res., 7: 606-614 

(1997)]; 5* nuclease assays [Pease et al, Proc. Natl Acad. ScL USA, 97:5022-5026 
(1994)]; DNA Microchip analysis [Ramsay, G., Nature Biotechnology, 16: 40-48 
(1999); and Chee et al, U.S. Patent No. 5,837,832]; and ligase chain reaction 
[Whiteley et al, U.S. Patent No. 5,521,065]. [See generally, Schafer and Hawkins, 

25 Nature Biotechnology, 16: 33-39 (1998).] All of the foregoing documents are hereby 

incorporated by reference in their entirety. 

Thus, in one preferred embodiment involving screening CON202 
sequences, for example, the assaying step comprises at least one procedure selected 
from the group consisting of: (a) determining a nucleotide sequence of at least one 

30 codon of at least one CON202 allele of the human subject; (b) performing a 

hybridization assay to determine whether nucleic acid from the human subject has a 
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luicleolide sequence identical to or different from one or more reference sequences;, 
(c) performing a polynucleotide migration assay to detemiine whether nucleic acid 
from the human subject has a nucleotide sequence identical to or different from one or 
more reference sequences; and (d) performing a restriction endonuclease digestion to 
5 detemiine whether nucleic acid from the human subject has a nucleotide sequence 

, identical to or different from one or more reference sequences. 

In a highly preferred embodiment, the assaying involves sequencing of 
nucleic acid to determine nucleotide sequence thereof, using any available sequencing 
technique. [See, e.g., Sanger et al., Proa Natl Acad. ScL (USA). 74: 5463-5467 

10 (1977) (dideoxy chain termination method); Mirzabekov, TIBTECH, 12: 27-32 (1994) 
(sequencing by hybridization); Drmanac ei al., Nature Biotechnology, 16: 54-58 
(1998); U.S. Patent No. 5,202,231; and Science, 260: 1649-1652 (1993) (sequencing 
by hybridization); Kieleczawa et al, Science, 258: 1787-1791 (1992) (sequencing by 
primer walking); (Douglas etal, Biotechniques, 14: 824-828 (1993) (Direct 

1 5 sequencing of PGR products); and Akane et al, Biotechniques 16: 238-241 (1 994); 
Maxam and Gilbert, Meth, Enzymol, 65: 499-560 (1977) (chemical termination 
sequencing), all incorporated herein by reference.] The analysis may entail sequencing 
of the entire seven transmembrane receptor gene genomic DNA sequence, or portions 
thereof; or sequencing of the entire seven transmembrane receptor coding sequence or 

20 portions thereof. In some circumstances, the analysis may involve a determination of 
whether an individual possesses a particular allelic variant, in which case sequencing 
of only a small portion of nucleic acid - enough to determine the sequence of a 
particular codon characterizing the allelic variant - is sufficient. This approach is 
appropriate, for example, when assaying to detemnine whether one family member 

25 inherited the same allelic variant that has been previously characterized for another 

family member, or, more generally, whether a person's genome contains an allelic 
variant that has been previously characterized and correlated with a mental disorder 
having a heritable component. 

In another highly preferred embodiment, the assaying step comprises 

30 performing a hybridization assay to determine whether nucleic acid from the human 
subject has a nucleotide sequence identical to or different fi-om one or more reference 
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sequences. In a preferred embodiment, the hybridization involves a determination of 
whether nucleic acid derived from the human subject will hybridize with one or more 
ohgonucleotides, wherein the oHgoniicleotides have nucleotide sequences thai 
correspond identically to a portion of the GPCR gene sequence taught herein, such as 
5 the CON202 coding sequence set forth in SEQ LD NO: 14, or that correspond 

identically except for one mismatch. The hybridization conditions are selected to 
differentiate between perfect sequence complementarity and imperfect matches 
differing by one or more bases. Such hybridization experiments thereby can provide 
single nucleotide polymorphism sequence information about the nucleic acid from the 
10 human subject, by virtue of knowing the sequences of the oligonucleotides used in the 

experiments. 

Several of the techniques outlined above involve an analysis wherein 
one performs a polynucleotide migration assay, e.g., on a polyacrylamide 
electrophoresis gel (or in a capillaiy electrophoresis system), under denaturing or non- 

1 5 denaturing conditions. Nucleic acid derived from the human subject is subjected to 
gel electrophoresis, usually adjacent to (or co-loaded with) one or more reference 
nucleic acids, such as reference GPCR-encoding sequences having a coding sequence 
identical to all or a portion of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, or 19 (or 
identical except for one known polymorphism). The nucleic acid from the human 

20 subject and the reference scquence{s) are subjected to similar chemical or enzymatic 

treatments and then electrophoresed under conditions whereby the polynucleotides 
will show a differential migration pattern, unless they contain identical sequences, 
[See generally Ausubel et al (eds.), Current Protocols in Molecular Biology, New 
York: John Wiley & Sons, Inc. (1987-1999); and Sambrook et al, (eds.), Molecular 

25 Cloning, A Laboratory Manual, Cold Spring Harbor, New York: Cold Spring Harbor 

Laboratory Press (1989), both incorporated herein by reference in their entirety.] 

hi the context of assaying, the term ^'nucleic acid of a human subject" 
is intended to include nucleic acid obtained directly from the human subject {e.g., 
DNA or RNA obtained from a biological sample such as a blood, tissue, or other cell 

30 or fluid sample); and also nucleic acid derived from nucleic acid obtained directly 
from the human subject. By way of non-limiting examples, well known procedures 
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exist for creating cDNA that is complementary to RNA derived from a biological 
sample from a human subject, and for amplifying (e.g., via polymerase chain reaction 
(PGR)) DNA or RNA derived from a biological sample obtained from a human 
subject. Any such derived polynucleotide which retains relevant nucleotide sequence 
5 information of the human subject's own DNA/RNA is intended to fall within the 

definition of "nucleic acid of a human subject" for the purposes of the present 
invention. 

In the context of assaying, die term "mutation" includes addition, 
deletion, and/or substitution of one or more nucleotides in the GPCR gene sequence 

10 (e.g., as compared to the seven transmembrane receptor-encoding sequences set forth 
in SEQ ID NO: 1, 3, 5, 7, 9. 1 1, 1 3 J 5, 17, or 19) and other polymorphisms that occur 
in introns (where introns exist) and that are identifiable via sequencing, restriction 
fragment length polymorphism, or other techniques. The various activity examples 
provided herein permit determination of whether a mutation modulates activity of the 

1 5 relevant receptor in the presence or absence of various test substances. 

In a related embodiment, the invention provides methods of screening 
a person's genotype with respect to GPCR's of the invention, and correlating such 
genotypes with diagnoses for disease or with predisposition for disease (for genetic 
counseHng). For example, the invention provides a method of screening for a 

20 CON202 hereditary schizophrenia genotype in a human patient, comprising the steps 
of: (a) providing a biological sample comprising nucleic acid from the patient, the 
nucleic acid including sequences corresponding to said patient's CON202 alleles; (b) 
analyzing the nucleic acid for the presence of a mutation or mutations; (c) determining 
a CON202 genotype from the analyzing step; and (d) correlating the presence of a 

25 mutation in a CON202 allele with a hereditary schizophrenia genotype. In a preferred 

embodiment, the biological sample is a cell sample containing human cells that 
contain genomic DNA of the human subject. The analyzing can be performed 
analogously to the assaying described in preceding paragraphs. For example, the 
analyzing comprises sequencing a portion of the nucleic acid (e.g., DNA or RNA), the 

30 portion comprising at least one codon of the CON202 alleles. 
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Although more time consuming and expensive than methods involving 
nucleic acid analysis, the invention also may be practiced by assaying protein of a 
human subject to detennine the presence or absence of an amino acid sequence 
variation in GPCR protein from the human subject. Such protein analyses may be 
5 performed, e.g., by fragmenting GPCR protein via chemical or enzymatic methods 

and sequencing the resultant peptides; or by Western analyses using an antibody 
having specificity for a particular allelic variant of the GPCR. 

The invention also provides materials that are useful for performing 
methods of the invention. For example, the present invention provides 

1 0 oligonucleotides useful as probes in the many analyzing techniques described above. 

In general, such oligonucleotide probes comprise 6, 7, 8, 9, 10 , 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 
40, 41 , 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides that have a sequence that is 
identical, or exactly complementary, to a portion of a human GPCR gene sequence 

1 5 taught herein (or allehc variant thereoQ, or that is identical or exactly complementary 

except for one nucleotide substitution. In a preferred embodiment, the 
oligonucleotides have a sequence that corresponds in the foregoing manner to a 
human GPCR coding sequence taught herein, and in particular, the coding sequences 
set forth in SEQ ID NOs: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, or 19. In one variation, an 

20 oligonucleotide probe of the invention is purified and isolated. In another variation, 

the ohgonucleotide probe is labeled, e.g., with a radioisotope, chromophore, or 
fluorophore. In yet another variation, the probe is covalently attached to a solid 
support. [See generally Ausubel et al And Sambrook et al, supra.] 

In a related embodiment, the invention provides kits comprising 

25 reagents that are useful for practicing methods of the invention. For example, the 

invention provides a kit for screening a human subject to diagnose schizophrenia or a 
genetic predisposition therefor, comprising, in association: (a) an ohgonucleotide 
useful as a probe for identifying polymorphisms in a human CON202 seven 
transmembrane receptor gene, the oligonucleotide comprising 6-50 nucleotides that 

30 have a sequence that is identical or exactly complementary to a portion of a human 

CON202 gene sequence or CON202 coding sequence, except for one sequence 
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dirference selected from the group consisting of a nucleotide addition, a nucleotide 
deletion, or nucleotide substitution; and (b) a media packaged with the 
oligonucleotide containing information identifying polymorphisms identifyable with 
the probe that correlate with schizophrenia or a genetic predisposition therefor. 
5 Exemplary information-containing media include printed paper package inserts or 

packaging labels; and magnetic and optical storage media that are readable by 
computers or machines used by practitioners who perform genetic screening and 
counseling services. The practitioner uses the information provided in the media to 
coiTelate the results of the analysis with the oligonucleotide with a diagnosis. Fn a 

10 preferred variation, the oligonucleotide is labeled. 

In still another embodiment, the invention provides methods of 
identifying those allelic variants of GPCR's of the invention that correlate with mental 
disorders. For example, the invention provides a method of identifying a seven 
transmembrane allelic variant that correlates with a mental disorder, comprising steps 

15 of: (a) providing a biological sample comprising nucleic acid from a human patient 

diagnosed with a mental disorder, or from the patient's genetic progenitors or 
progeny; (b) analyzing the nucleic acid for the presence of a mutation or mutations in 
at least one seven transmembrane receptor that is expressed in the brain, wherein the 
at least one seven transmembrane receptor comprises an amino acid sequence selected 

20 from the group consisting ofSEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 1 8, and 20, or an 

allelic variant thereof, and wherein the nucleic acid includes sequence corresponding 
to the gene or genes encoding the at least one seven transmembrane receptor; (c) 
determining a genotype for the patient for the at least one seven transmembrane 
receptor from said analyzing step; and (d) identifying an allelic variant that correlates 

25 with the mental disorder from the determining step. To expedite this process, it may 

be desirable to perform linkage studies in the patients (and possibly their families) to 
correlate chromosomal markers with disease states. The chromosomal localization 
data provided herein facihtates identifying an involved GPCR with a chromosomal 
marker. 

30 The foregoing method can be performed to correlate GPCR*s of the 

invention to a number of disorders having hereditary components that are causative or 
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that predispose persons to the disorder. For example, in one prefeiTed variation, the 
disorder is schizophrenia, and the at least one seven transmembrane receptor 
comprises CON202 having an amino acid sequence set forth in SEQ ID NO: 14, or an 
allehc variant thereof. 

5 Also contemplated as part of the invention are polynucieotides that 

comprise the allelic variant sequences identified by such methods, and polypeptides 
encoded by the allelic variant sequences, and oligonucleotide and oligopeptide 
fragments therof that embody the mutations that have been identified. Such materials 
aie useful in /// vitro cell-free and cell-based assays for idenifying lead compounds 

10 and therapeutics for treatment of the disorders. For example, the variants are used in 

activity assays, binding assays, and assays to screen for activity modulators described 
herein. In one preferred embodiment, the invention provides a purified and isolated 
polynucleotide comprising a nucleotide sequence encoding a CON202 receptor allelic 
variant identified according to the methods described above; and an oligonucleotide 

1 5 that comprises the sequences that differentiate the allelic variant from the CON202 

sequences set forth in SEQ ID NOs: 13 and 14. The invention also provides a vector 
comprising the polynucleotide (preferably an expression vector); and a host cell 
transformed or transfected with the polynucleotide or vector. The invention also 
provides an isolated cell line that is expressing the allelic variant GPCR polypeptide; 

20 purified cell membranes from such cells; purified polypeptide; and synthetic peptides 

that embody the allelic variation amino acid sequence. In one particular embodiment, 
the invention provides a purified polynucleotide comprising a nucleotide sequence 
encoding a CON202 seven transmembrane receptor protein of a human that is affected 
with schizophrenia; wherein said polynucleotide hybridizes to the complement of 

25 SEQ ID NO: 13 under the following hybridization conditions: (a) hybridization for 16 

hours at Al^'C in a hybridization solution comprising 50% formamide, 1% SDS, 1 M 
NaCl, 10% dextran sulfate and (b) washing 2 times for 30 minutes at "60°C in a wash 
solution comprising O.lx SSC and 1% SDS; and wherein the polynucleotide encodes 
a CON202 amino acid sequence that differs from SEQ ID NO: 14 at at least one 

30 residue. 
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An examplary assay for using the allelic variants is a method for 
identifying a modulator of CON202 biological activity, comprising the steps of: (a) 
contacting a cell expressing the allelic variant in the presence and in the absence of a 
putative modulator compound; (b) measuring CON202 biological activity in the cell; 
5 and (c) identifying a putative modulator compound in view of decreased or increased 

CON202 biological activity in the presence versus absence of the putative modulator. 

In still another example, the invention provides for a method of 
diagnosing schizophrenia or a susceptibility to schizophrenia comprising the steps of: 
determining the presence or amount of expression of CON202 polypeptide as set out 

10 as SEQ ID NO: 14 or the polypeptide encoded by the nucleic acid molecule having 
SEQ ID NO: 13 in a sample; and comparing the level of CON202 polypeptide in a 
biological, tissue or cellular sample from normal subjects or the subject at an earlier 
time, wherein the susceptibility to schizophenia is based on the presence or amount of 
CON202 polypeptide expression. 

15 The invention also provides for a method of treating schizophrenia 

comprising the step of administering to a human diagnosed with schizophrenia an 
amount of a modulator of CON202 receptor activity sufficient to modulate CON202 
receptor activity or CON202 ligand binding in said human. 

The invention also provides assays to identify compounds that bind 

20 GPCR seven transmembrane receptors. One such assay comprises the steps of: (a) 

contacting a composition comprising one of the GPCR seven transmembrane receptor 
polypeptides of the invention with a compound suspected of binding a GPCR 
polypeptide of the invention; and (b) measuring binding between the compound and 
the GPCR polypeptide. In one variation, the composition comprises a cell expressing 

25 a GPCR polypeptide of the invention on its surface. In another variation, an isolated 

GPCR polypeptide of the invention or cell membranes comprising a GPCR 
polypeptide of the invention are employed. The binding may be measured directly, 
e.g., using a labeled compound, or may be measured indirectly by several techniques, 
including measuring intracellular signahng of a GPCR polypeptide of the invention 

30 induced by the compound (or measuring changes in the level of GPCR polypeptide 

signaling). 
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The invention also provides a method for identifying a modulator of 
binding between a GPCR seven transmembrane receptor of the invention and a GPCR 
polypeptide binding partner, comprising the steps of: (a) contacting a GPCR 
polypeptide binding partner and a composition comprising one of the GPCR seven 
5 transmembrane receptors of the invention in the presence and in the absence of a 

putative modulator compound; (b) detecting binding between the binding partner and 
the GPCR polypeptide of the invention; and (c) identifying a putative modulator 
compound in view of decreased or increased binding between the binding partner and 
the GPCR polypeptide in the presence of the putative modulator, as compared to 

1 0 binding in the absence of the putative modulator. 

GPCR polypeptide binding partners that stimulate GPCR seven 
transmembrane receptors of the present invention are useful as agonists in disease 
states characterized by insufficient GPCR polypeptide signaling {e.g., as a result of 
insufficient expression of active GPCR polypeptide ligand). GPCR polypeptide 

1 5 binding partners that block ligand-mediated GPCR polypeptide signaling are useful as 

GPCR polypeptide antagonists to treat disease states characterized by excessive 
GPCR polypeptide signaling. 

Additional features and variations of the invention will be apparent to 
those skilled in the art from the entirety of this appHcation, including the detailed 

20 description, and all such features are intended as aspects of the invention. Likewise, 

features of the invention described herein can be re-combined into additional 
embodiments that also are intended as aspects of the invention, irrespective of 
wliether the combination of features is specifically mentioned above as an aspect or 
embodiment of the invention. Also, only such Hmitations which are described herein 

25 as critical to the invention should be viewed as such; variations of the invention 

lacking limitations which have not been described herein as critical are intended as 
aspects of the invention. 

In addition to the foregoing, the invention includes, as an additional 
aspect, all embodiments of the invention narrower in scope in any way than the 

30 variations specifically mentioned above. Although the applicant(s) invented the full 

scope of the claims appended hereto, the claims appended hereto are not intended to 
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encompass within their scope the prior art work ofolliers. Therefore, in the event that 
stalulory prior art within the scope of a claim is brought to the attention of the 
appMcants by a Patent Office or other entity or individual, the app!icant(s) reserve the 
right to exercise aniendment rights under apphcable patent laws to redefine the 
5 subject matter of such a claim to specifically exclude such statutory prior art or 

obvious variations of statutory prior art from the scope of such a claim. Variations of 
the invention defined by such amended claims also are intended as aspects of the 
invention. 

1 0 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides purified and isolated polynucleotides 
(e.g., DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single and double stranded, including splice variants thereof) encoding 
human G protein-coupled receptors referred to herein as GPCR polypeptides. DNA 

1 5 polynucleotides of the invention include genomic DNA, cDNA, and DMA that has 

been chemically synthesized in whole or in part. "Synthesized" as used herein and 
understood in the art, refers to polynucleotides produced by purely chemical, as 
opposed to enzymatic, methods. "Wholly" synthesized DNA sequences are therefore 
produced entirely by chemical means, and "partially" synthesized DNAs embrace 

20 those wherein only portions of the resulting DNA were produced by chemical means. 

Genomic DNA of the invention comprises the protein coding region 
for a polypeptide of the invention and is also intended to include allelic variants 
thereof It is widely understood tliat, for many genes, genomic DNA is transcribed 
into RNA transcripts that undergo one or more splicing events wherein intron {i.e., 

25 non-coding regions) of the transcripts are removed, or "spliced out." RNA transcripts 

that can be spliced by alternative mechanisms, and therefore be subject to removal of 
different RNA sequences but still encode a GPCR polypeptide of the present 
invention, are referred to in the art as splice variants which are embraced by the 
invention. Splice variants comprehended by the invention therefore are encoded by 

30 the same original genomic DNA sequences but arise from distinct mRNA transcripts. 

Allelic variants are modified forms of a wild type gene sequence, the modification 
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resulling from recombination during chromosomal segregation or exposure to 
conditions which give rise to genetic mutation. Allelic variants, hke wild type genes, 
are naturally occurring sequences (as opposed to non-naturally occuiring variants 
which arise from in vitro manipulation). 
5 The invention also comprehends cDNA that is obtained through 

reverse transcription of an RNA polynucleotide encoding a GPCR of the present 
invention (conventionally followed by second strand synthesis of a complementary 
strand to provide a double-stranded DNA). 

A preferred DNA sequence encoding a human GPCR polypeptide is set 

10 out in SEQ ID NO: 1, wherein nucleotides 157 to 1122 represent the CON193 coding 

sequence, with tennination codon (siurounded by upstream and downstream 
untranslated sequences). Another preferred DNA sequence encoding a human GPCR 
polypeptide is set out in SEQ ID NO: 3, wherein nucleotides I to 1014 represent the 
CON166 coding sequence and stop codon. Still another preferred DNA sequence 

15 encoding a human GPCR polypeptide is set out in SEQ ID NO: 5, wherein 

nucleotides 691 to 1845 represent the CON103 coding sequence with stop codon 
(surrounded by upstream and downstream untranslated sequences). Another 
preferred DNA sequence encoding a human GPCR polypeptide is set out in SEQ ID 
NO: 7, wherein nucleotides 146 to 1 147 represent the CON203 coding sequence with 

20 stop codon (surrounded by upstream and downstream untranslated sequences). A 

preferred DNA sequence encoding a human GPCR polypeptide is set out in SEQ ID 
NO: 9, wherein nucleotides 1 to 957 represent the CON198 coding sequence with stop 
codon. Another preferred DNA sequence encoding a human GPCR polypeptide is set 
out in SEQ ID NO: 11, wherein nucleotides 1 to 924 represent the CON 197 coding 

25 sequence with stop codon (followed by downstream untranslated sequences). A 

preferred DNA sequence encoding a human GPCR polypeptide is set out in SEQ ID 
NO: 13, wherein nucleotides 266 to 1378 represent the CON202 coding sequence and 
termination codon (surrounded by upstream and downstream untranslated sequences). 
A preferred DNA sequence encoding a human GPCR polypeptide is set out in SEQ ID 

30 NO: 15, wherein nucleotides 1 to 1 191 represent the CON222 coding sequence and 

termination codon. A preferred DNA sequence encoding a human GPCR polypeptide 
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is set out in SEQ ID NO: 1 7, wherein nucleotides 13 to 1089 represent the C0N2 1 5 
coding sequence and termination codon (surrounded by upstream and downstream 
untranslated sequences). A prefened DNA sequence encoding a human GPCR 
polypeptide is set out in SEQ ID NO: 19, wherein nucleotides 42 to 1 157 represent 
5 the C0N21 7 coding sequence (surrounded by upstream and downstream untranslated 

sequences). The foregoing sequences without their termination codons also comprise 
preferred sequences. 

The worker of skill in the art will readily appreciate that the preferred 
DNA of the invention comprises a double stranded molecule, for example the 

10 * molecule having any one of the sequences set forth in SEQ ED NOS: I, 3, 5, 7, 9, 1 1, 
13, 15, 17, or 19 (or coding portions thereof) along v^ith the complementary molecule 
(the "non-coding strand" or "complement") having a sequence deducible from the 
sequence of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, or 19 according to Watson- 
Crick base pairing rules for DNA. Also preferred are other polynucleotides encoding 

15 the GPCR polypeptides of the invention set forth in SEQ ID NOS: 2, 4, 6, 8, 10, 12, 

14, 16, 18 and 20 which differ in sequence from the polynucleotide of SEQ ID NOS: 
T , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7, or 1 9, respectively, by virtue of the well-known 
degeneracy of the universal genetic code. 

The invention further embraces species, preferably mammalian, 

20 homologs of the human GPCR DNAs. Species homologs, sometimes referred to as 

"orthologs," in general, share at least 35%, at least 40%, at least 45%, at least 50%, at 
least 60%, at least 65%, at least 70%. at least 75%, at least 80%, at least 85%, at least 
90%, at least 95%, at least 98%, or at least 99% homology with human DNA of the 
invention. Percent sequence "homology" with respect to polynucleotides of the 

25 invention is defined herein as the percentage of nucleotide bases in the candidate 

sequence that are identical to nucleotides in the GPCR sequence set forth in any one 
of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, or 19 after ahgning the sequences and 
introducing gaps, if necessary, to achieve the maximum percent sequence identity. 

The polynucleotide sequence information provided by the invention 

30 makes possible large scale expression of the encoded polypeptide by teclmiques well 

known and routinely practiced in the art. Polynucleotides of the invention also pemiit 
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idenlification and isolation ofpolynucleotides encoding related GPCR polypeptides, 
such as human allelic variants and species homologs, by well known techniques 
including Southern and/or.Northern hybridization, and polymerase chain reaction 
(PCR). Examples of related polynucleotides include human and non-human genomic 
5 sequences, including allelic variants, as well as polynucleotides encoding polypeptides 

homologous to GPCR polypeptides and stmcturally related the polypeptides sharing 
one or more biological, immunological, and/or physical properties of the GPCR 
polypeptides. Non-human species genes encoding proteins homologous to GPCR 
polypeptides caii also be identified by Southern and/or PCR analysis and are useful in 

1 0 animal models for GPCR-related disorders. Knowledge of the sequence of a human 
GPCR DNA also makes possible, through use of Southern hybridization or 
polymerase chain reaction (PCR), the identification of genomic DNA sequences 
encoding GPCR expression control regulatory sequences such as promoters, 
operators, enhancers, repressors, and the Hke. Polynucleotides of the invention are 

15 also useful in hybridization assays to detect the capacity of cells to express GPCR 

polypeptides. Polynucleotides of the invention may also be the basis for diagnostic 
methods useful for identifying a genetic alteration(s) in a GPCR locus that underlies a 
disease state or states, which information is useful both for diagnosis and for selection 
of therapeutic strategies. 

20 The disclosure herein of full length polynucleotides encoding GPCR 

polypeptides of the present invention makes readily available to the worker of 
ordinary skill in the art every possible fragment of the full length polynucleotides. 
The invention therefore provides fragments of GPCR-encoding polynucleotides 
comprising at least 14-15, and preferably at least 18, 20, 25, 50, or 75 consecutive 

25 nucleotides of a polynucleotide encoding GPCR polypeptides. Preferably, fragment 

polynucleotides of the invention comprise sequences unique to the GPCR-encoding 
polynucleotide sequence, and therefore hybridize under highly stringent or moderately 
stringent conditions only (i.e., "specifically") to polynucleotides encoding GPCR 
polypeptides (or fragments thereof). Polynucleotide fragments of genomic sequences 

30 of the invention comprise not only sequences unique lo the coding region, but also 

include fragments of the full length sequence derived from introns, regulatory regions, 



wo 01/31014 



PCT/USOO/29601 



-45- 

and/or other non-translated sequences. Sequences unique to polynucleotides of the 
invention are recognizable through sequence compcirison to other known 
polynucleotides, and can be identified through use oT alignment programs routinely 
utilized in the art, e.g., those made available in public sequence databases. Such 
5 sequences also are recognizable From Southern and Northern hybridization analyses to 

determine the number of fragments of genomic DNA and RNA to which a 
polynucleotide will hybridize. Polynucleotides of the invention can be labeled in a 
manner that permits their detection, including radioactive, fluorescent, and enzymatic 
labeUng. 

1 0 Fragment polynucleotides are particularly useful as probes for 

detection of full length or other fragment GPCR polynucleotides. One or more 
ft-agment polynucleotides can be included in kits that are used to detect the presence 
of a polynucleotide encoding a GPCR polypeptide, or used to detect variations in a 
polynucleotide sequences encoding GPCR polypeptides. 

1 5 The invention also embraces DNAs encoding GPCR polypeptides 

which DNAs hybridize under moderately stringent or high stringency conditions to 
the non-coding strand, or complement, of the polynucleotide in any one of SEQ ID 
NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17 or 19. 

Exemplary highly stringent hybridization conditions are as follows: 

20 hybridization at 42''C in a hybridization solution comprising 50% formamide, 1% 

SDS, 1 M NaCl, 10% Dextran sulfate, and washing twice for 30 minutes at 60^C in a 
wash solution comprising O.lx SSC and 1% SDS. It is understood in the art that 
conditions of equivalent stringency can be achieved through variation of temperature 
and buffer, or salt concentration as described Ausubel, et al. (Eds.), Protocols in 

25 Molecular Biology, John Wiley & Sons (1994), pp. 6.0.3 to 6.4.1 0. Modifications in 

hybridization conditions can be empirically determined or precisely calculated based 
on the length and the percentage of guanosine/cytosine (GC) base pairing of the 
probe. The hybridization conditions can be calculated as described in Sambrook et 
al., (Eds.), Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 

30 Press: Cold Spring Harbor, New York (1989), pp. 9.47 to 9.51. 
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Autonomously replicating recombinant expression construcls such as 
plasmid and viral DNA vectors incorporating polynucleotides of-' ihe invention are also 
provided. Expression constructs wherein GPCR-encoding polynucleotides are 
operatively linked to an endogenous or exogenous expression control DNA sequence 
5 and a transcription terminator are also provided. Expression control DNA sequences 

include promoters, enhancers, and operators, and are generally selected based on the 
expression systems in v^hich the expression constaict is to be utilized. Preferred 
promoter and enhancer sequences are generally selected for the ability to increase 
gene expression, while operator sequences are generally selected for the ability to 

10 regulate gene expression. Expression constructs of the invention may also include 

sequences encoding one or more selectable markers that permit identification of host 
cells bearing the construct. Expression constructs may also include sequences that 
facilitate, and preferably promote, homologous recombination in a host cell. Preferred 
constructs of the invention also include sequences necessary for replication in a host 

15 cell. 

Expression constructs are preferably utilized for production of an 
encoded protein, but also maybe utilized simply to amplify GPCR-encoding 
polynucleotide sequences. 

According to another aspect of the invention, host cells are provided, 

20 including prokaryotic and eukaryotic cells, comprising a polynucleotide of the 

invention (or vector of the invention) in a manner which permits expression of the 
encoded GPCR polypeptide. Polynucleotides of the invention may be introduced into 
the host cell as part of a circular plasmid, or as linear DNA comprising an isolated 
protein coding region or a viral vector. Methods for introducing DNA into the host 

25 cell well known and routinely practiced in the art include transformation, transfection, 

electroporation, nuclear injection, or fusion with carriers such as liposomes, micelles, 
ghost cells, and protoplasts. Expression systems of the invention include bacterial, 
yeast, fungal, plant, insect, invertebrate, and mammalian cells systems. 

Klost cells of the invention are a valuable source of immunogen for 

30 development of antibodies specifically immunoreactive with GPCR polypeptides. 

Host cells of the invention are also useful in methods for large scale production of 
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GPCR polypeptides wherein the cells are grown in a suitable culture medium and the 
desired polypeptide products are isolated from the cells or from the medium in which 
the cells are grown by purification methods known in the art, e.g., conventional 
chromatographic metliods including immunoaffmity chromatography, receptor 
5 affinity chromatography, hydrophobic interaction chromatography, lectin affinity 

chromatography, size exclusion filtration, cation or anion exchange chromatography, 
high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those wherein the desired protein is expressed 
and purified as a fusion protein having a specific tag, label, or chelating moiety that is 

10 recognized by a specific binding partner or agent. The purified protein can be cJeaved 
to yield the desired protein, or be left as an intact fusion protein. Cleavage of the 
fusion component may produce a form of the desired protein having additional amino 
acid residues as a result of the cleavage process. 

Knowledge of GPCR DNA sequences allows for modification of cells 

15 to permit, or increase, expression of endogenous GPCR. Cells can be modified (e.^., 

by homologous recombination) to provide increased expression by replacing, in whole 
or in part, the naturally occurring GPCR promoter with all or part of a heterologous 
promoter so that the cells express GPCR polypeptides at higher levels. The 
heterologous promoter is inserted in such a manner that it is operatively linked to 

20 endogenous GPCR polypeptide encoding sequences. [See, for example, PCT 

International Publication No. WO 94/12650, PCT hitemational Publication No. WO 
92/20808, and PCT International Publication No. WO 91/09955.] It is also 
contemplated that, in addition to heterologous promoter DNA, amplifiable marker 
DNA {e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbarn yl 

25 phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron 

DNA may be inserted along with the heterologous promoter DNA. If linked to the 
GPCR coding sequence, amplification of the marker DNA by standard selection 
methods results in co-amplification of the GPCR coding sequences in the ceils. 

The DNA sequence information provided by the present invention also 

30 makes possible the development through, e.g. homologous recombination or 

"knock-out" strategies [Capecchi, Science 244: 1288-1292 (1989)], of animals that 
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fail to express functional GPCR polypeptides or that express a variant of GPCR 
polypeptides. Such animals (especially small laboratory animals such as rats, rabbits, 
and mice) are useful as models for studying the in vivo activities of GPCR 
polypeptides and modulators of GPCR polypeptides. 
5 Also made available by the invention are anti-sense polynucleotides 

which recognize and hybridize to polynucleotides encoding GPCR polypeptides. Full 
length and fragment anti-sense polynucleotides are provided. Fragment anti-sense 
molecules of the invention include those which specifically recognize and hybridize to 
GPCR RNA (as detemiined by sequence comparison of DNA encoding GPCR 

10 polypeptides to DNA encoding other known molecules). Identification of sequences 
unique to GPCR-encoding polynucleotides, can be deduced through use of any 
publicly available sequence database, and/or through use of commercially available 
sequence comparison programs. The uniqueness of selected sequences in an entire 
genome can be further verified by hybridization analyses. After identification of the 

1 5 desired sequences, isolation through restriction digestion or amplification using any of 

the various polymerase chain reaction techniques well known in the art can be 
performed. Antisense polynucleotides are particularly relevant to regulating 
expression of GPCR polypeptides by those cells expressing GPCR mRNA. 

Antisense nucleic acids (preferably 10 to 20 base pair oligonucleotides) 

20 capable of specifically binding to GPCR expression control sequences or GPCR RNA 

are introduced into cells (e.g., by a viral vector or colloidal dispersion system such as 
a liposome). The antisense nucleic acid binds to the GPCR target nucleotide sequence 
in the cell and prevents transcription or translation of the target sequence. 
Phosphorothioate and methylphosphonate antisense oligonucleotides are specifically 

25 contemplated for therapeutic use by the invention. The antisense oligonucleotides 

may be further modified by poly-L-lysine, transferrin polylysine, or cholesterol 
moieties at their 5' end. Suppression of GPCR polypeptide expression at either the 
transcriptional or translalional level is useful to general cellular and/or animal models 
for diseases characterized by aberrant expression. Suppression of GPCR polypeptide 

30 expression at either the transcriptional or translational level is useful to generate 
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cellular animal models for diseases characterized by aberrant GPCR polypeptide 
expression. 

The GPCR polynucleotide and polypeptide sequences taught in the 
present invention facilitate the design of novel transcription factors for modulating 
5 GPCR polypeptide expression in native cells and animals, and cells transformed or 

transfected v^ith GPCR polynucleotides. For example, the Cysj-Hisz zinc finger 
proteins, which bind DNA via their zinc fmger domains, have been shown to be 
amenable to structural changes that lead to the recognition of different target 
sequences. These artificial zinc fmger proteins recognize specific target sites with 

10 high affinity and low dissociation constants, and are able to act as gene switches to 

modulate gene expression. Knowledge of the particular GPCR target sequence of the 
present invention facilitates the engineering of zinc finger proteins specific for the 
target sequence using known methods such as a combination of structure-based 
modeling and screening of phage display libraries [Segal et al.Proc Natl Acad Set 

15 USA 96: 2758-2763 (1999); Liu etal., Proc Natl Acad Sci USA 94: 5525-30 (1997); 

Greisman and Pabo Science 275: 657-61 (1997); Choo et al,JMolBiol 273: 525-32 
(1997)]. Each zinc finger domain usually recognizes three or more base pairs. Since 
a recognition sequence of 18 base pairs is generally sufficient in length to render it 
unique in any known genome, a zinc finger protein consisting of 6 tandem repeats of 

20 zinc fingers would be expected to ensure specificity for a particular sequence [Segal et 
al., Proc Natl Acad Sci USA 96: 2758-2763 (1999)], The artificial zinc finger repeats, 
designed based on GPCR polynucleotide sequences, are fused to activation or 
repression domains to promote or suppress GPCR polypeptides expression [Liu et ai, 
Proc Natl Acad Sci USA 94: 5525-30 (1997)]. Alternatively, the zinc finger domains 

25 can be fused to the TATA box-binding factor (TBP) with varying lengths of linker 

region between the zinc finger peptide and the TBP to create either transcriptional 
activators or repressors [Kim et ai, Proc Natl Acad Sci USA 94: 3616-3620 (1997)]. 
Such proteins, and polynucleotides that encode them, have utility for modulating 
GPCR polypeptide expression in vivo in both native cells, animals and humans; 

30 and/or cells transfected with GPCR polynulcoeitde-encoding sequences. The novel 
transcription factor can be delivered to the target cells by transfecting constructs that 
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express the transcription factor (gene therapy), or by inhoducing the protein. 
Engineered zinc fmger proteins can also be designed to bind RNA sequences for use 
in therapeutics as alternatives to antisense or catalytic RNA methods [McColl et al., 
Proc Natl Acad Sci USA 96:9521-6 (1999); Wu et ai, Proc Natl Acad Sci USA 
5 92:344-348 ( 1 995)]. The present invention contemplates methods of designing such 

transcription factors based on the gene sequence of the invention, as well as 
customized zinc finger proteins, that are useful to modulate GPCR polypeptide 
expression in cells (native or transformed) whose genetic complement includes these 
sequences. 

10 The invention also provides purified and isolated mammalian GPCR 

polypeptides encoded by a polynucleotide of the invention. Presently preferred is a 
human GPCR polypeptide comprising the amino acid sequence set out in any one of 
SEQ ED NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18 or 20. 

The invention also embraces polypeptides that have at least 99%, at 

15 least 95%, at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 

65%, at least 60%, at least 55% or at least 50% identity and/or homology to a 
preferred polypeptide of the invention. Percent amino acid sequence 'Identity" with 
respect to the preferred polypeptide of the invention is defined herein as the 
percentage of amino acid residues in the candidate sequence that are identical with the 

20 residues in a GPCR polypeptide sequence after aligning both sequences and 

introducing gaps, if necessary, to acliieve the maximum percent sequence identity, and 
not considering any conservative substitutions as part of the sequence identity. 
Percent sequence "homology" with respect to the preferred polypeptide of the 
invention is defined herein as the percentage of amino acid residues in the candidate 

25 sequence that are identical with the residues in a GPCR sequence after aligning the 

sequences and introducing gaps, if necessary, to achieve the maximum percent 
sequence identity, and also considering any conservative substitutions as part of the 
sequence identity. 

In one aspect, percent homology is calculated as the percentage of 

30 amino acid residues in the smaller of two sequences which align with identical amino 

acid residue in the sequence being compared, when four gaps in a length of 100 amino 
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acids may be introduced to maximize alignment [Dayhoff, in Atlas of Protein 
Sequence and Structure, Vol. 5, p. 124, National Biochemical Research Foundation, 
Washington, D.C. (1972), incorporated herein by reference]. 

Polypeptides of the invention maybe isolated from natural cell 
sources or may be chemically synthesized, but are preferably produced by 
recombinant procedures involving host cells of the invention. Use of mammalian host 
cells is expected to provide for such post-translational modifications (e.g., 
glycosylation, truncation, lipidation, and phosphorylation) as may be needed to confer 
optimal biological activity on recombinant expression products ofthe invention. 
Glycosylated and non-glycosylated forms of GPCR polypeptides are embraced. 

The invention also embraces variant (or analog) GPCR polypeptides. 
In one example, insertion variants are provided wherein one or more amino acid 
residues supplement a GPCR amino acid sequence. Insertions may be located at 
either or both tennini of the protein, or may be positioned within internal regions of 
the GPCR amino acid sequence. Insertional variants with additional residues at either 
or both termini can include for example, fusion proteins and proteins including amino 
acid tags or labels. 

Lisertion variants include GPCR polypeptides wherein one or more 
amino acid residues are added to a GPCR amino acid sequence, or to a biologically 
active fragment thereof. 

Variant products ofthe invention also include mature GPCR 
polypeptide products, i.e., GPCR polypeptide products wherein leader or signal 
sequences are removed, with additional amino terminal residues. The additional 
amino terminal residues may be derived from another protein, or may include one or 
more residues that are not identifiable as being derived from a specific proteins. 
GPCR polypeptide products with an additional methionine residue at position -1 
(Mer'-GPCR) are contemplated, as are variants with additional methionine and lysine 
residues at positions -2 and -1 (Mef^-Lys'-GPCR). Variants of GPCR polypeptide 
with additional Met, Met-Lys, Lys residues (or one or more basic residues in general) 
are particularly useful for enhanced recombinant protein production in bacterial host 
cell. 
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The invention also embraces GPCR polypeptide variants having 
additional amino acid residues which result from use of specific expression systems. 
For example, use of commercially available vectors that express a desired polypeptide 
as part of glutathione-S-transferase (GST) fusion product provides the desired 
5 polypeptide having an additional glycine residue at position -1 after cleavage of the 

GST component from the desired polypeptide. Variants which result from expression 
in other vector systems are also contemplated. 

insertional variants also include fusion proteins wherein the amino 
and/or carboxy termini of a GPCR polypeptide is flised to another polypeptide. 

10 In another aspect, the invention provides deletion variants wherein one 

or more amino acid residues in a GPCR polypeptide are removed. Deletions can be 
effected at one or both lennini of the GPCR polypeptide, or with removal of one or 
more residues within the GPCR amino acid sequence. Deletion variants, therefore, 
include all fragments of a GPCR polypeptide. 

15 The invention also embraces polypeptide fragments of the sequence set 

out in SEQ ED NO: 2 wherein the fragments maintain biological (e.g., ligand binding 
and/or intracellular signaling) or immunological properties of a GPCR polypeptide. 
Fragments comprising at least 5, 10, 15, 20, 25, 30, 35, or 40 consecutive amino acids 
of SEQ ID NO: 2 are comprehended by the invention. Preferred polypeptide 

20 fragments display antigenic properties unique to or specific for human GPCR and its 

allelic and species homologs. Fragments of the invention having the desired 
biological and immunological properties can be prepared by any of the methods well 
known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of 

25 GPCR polypeptides. Substitution variants include those polypeptides wherein one or 

more amino acid residues of a GPCR polypeptide are removed and replaced with 
alternative residues. In one aspect, the substitutions are conservative in nature, 
however, the invention embraces substitutions that are also non-conservative. 
Conservative substitutions for this purpose may be defined as set out in Tables A, B, 

30 or C below. 
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Variant polypeptides include those wherein conservative substitutions 
have been introduced by modificalion of polynucleotides encoding polyi:)eptides of the 
invention. Amino acids can be classified according to physical properties and 
contribution to secondary and tertiary protein structure. A conservative substitution is 
recognized in the art as a substitution of one amino acid for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table A 
(from WO 97/09433, page 10, pubhshed March 13, 1997 (PCT/GB96/02197, filed 
9/6/96), immediately below. 



Table A 



Conservative Substitutions 1 



SIDE CHAIN 



CHARACTERISTIC 



AMINO ACID 



Aliphatic 



Non-polar 
Polar - uncharged 
Polar - charged 



GA P I L V 



CSTMNQ 



DEKR 



Aromatic 



HF W Y 



Other 



NQDE 



Alternatively, conservative amino acids can be grouped as described in Lehninger, 
[Biochemistry, Second Edition; Worth Publishers, Inc. NY:NY (1975), pp.71 -77] as 
set out in Table B, immediately below. 
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Table B 
Conservative Substitutions 11 

5 SIDE CHAIN 

CHARACTERISTIC AMINO ACID 

Non-polar (hydrophobic) 

A. Aliphatic: A L T V P 
10 B. Aromatic: F W 

C. Sulfur-containing: M 

D. Borderline: G 
Uncharged-polar 

A. Hydroxy): STY 

15 B. Amides: NQ 

C. Sulfhydryl: C 

D. Borderline: G 
Positively Charged (Basic): K R H 
Negatively Charged (Acidic): . DE 

20 



As still an another alternative, exemplary conservative substitutions are set out in 
Table C, immediately below. 
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Table C 
Conservative Substitutions III 



Original 


Excmplarv Substitution 


Residue 




Ala (A) 


Val, Leu, Tie 


Arg(R) 


Lys, Gin, Asn 


Asn (N) 


Gin, His, Lys, Arg 


Asp (D) 


Glu 


Cys(C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


His (H) 


Asn, Gin, Lys, Arg 


He (I) 


Leu, Val, Met, Ala, Phe, 


Leu(L) 


He, Val, Met, Ala, Phe 


Lys (K) 


Arg, Gin, Asn 


Met (M) 


Leu, Phe, Tie 


Phe (F) 


Leu, Val, He, Ala 


Pro (P) 


Gly 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W) 


Tyr 


Tyr(Y) 


Tip, Phe, Thr, Ser 


Val (V) 


He, I^u, Met, Phe, Ala 



GPCR polypeptide variants that display ligand binding properties of 
native GPCR polypeptides and are expressed at higher levels, and variants that 
provide for constitutive active receptor are particularly useful in assays of the 
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invention. Such variants also are useful in cellular and animal models for diseases 
characterized by aberrant GPCR polypeptide expression/activity. 

It should be understood that the definition of polypeptides of the 
invention is intended to include polypeptides bearing modifications other than 
5 insertion, deletion, or substitution of amino acid residues. By way of example, the 

modifications may be covalent in nature, and include for example, chemical bonding 
with polymers, lipids, other organic, and inorganic moieties. Such derivatives may be 
prepared to increase circulating half-life of a polypeptide, or may be designed to 
improve targeting capacity for the polypeptide to desired cells, tissues, or organs. 

10 Similarly, the invention further embraces GPCR polypeptides that have 

been covalently modified to include one or more water soluble polymer attachments 
such as polyethylene glycol, polyoxyethylene glycol, or polypropylene glycol. 

In a related embodiment, tlie present invention provides compositions 
comprising purified polypeptides of the invention. Preferred compositions comprise, 

15 in addition to the polypeptide of the invention, a pharmaceutically acceptable (Le., 

sterile and non-toxic) liquid, semisolid, or solid diluents that serve as pharmaceutical 
vehicles, excipients, or media. Any diluent known in the art may be used. Exemplary 
diluents include, but are not limited to, water, saline solutions, polyoxyethylene 
sorbitan monolaurate, magnesium stearate, methyl- and propylhydroxybenzoate, talc, 

20 alginates, starches, lactose, sucrose, dextrose, sorbitol, mannitol, glycerol, calcium 
phosphate, mineral oil, and cocoa butter. 

Also comprehended by the present invention are antibodies (e.g., 
monoclonal and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
bifunctional/bispecific antibodies, humanized antibodies, human antibodies, and 

25 complementary determining region (CDR)-grafted antibodies, including compounds 

which include CDR sequences which specifically recognize a polypeptide of the 
invention) specific for GPCR polypeptides of the invention or fragments thereof 
Preferred antibodies of the invention are human antibodies which can be produced 
and identified according to methods described in W093/1 1236, published June 20, 

30 1993, which is incorporated herein by reference in its entirety. Antibody fragments, 

including Fab, Fab', F(ab')2, and F^, are also provided by the invention. The term 
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"specific for/' when used to describe antibodies of the invenlion, indicates that the 
variable regions of the antibodies of the invention recognize and bind GPCR 
polypeptides exclusively (/.e, able to distinguish GPCR polypeptides from other 
known GPCR polypeptides by virtue of measurable differences in binding affinity, 
5 despite the possible existence of localized sequence identity, homology, or similarity 

between GPCR polypeptides and such polypeptides). It will be understood that 
specific antibodies may also interact with other proteins (for example, S. aureus 
protein A or other antibodies in ELISA techniques) through interactions with 
sequences outside the variable region of the antibodies, and in particular, in the 

1 0 constant region of the molecule. Screening assays to determine binding specificity of 

an antibody of the invention are well known and routinely practiced in the art. For a 
comprehensive discussion of such assays, see Harlow et al (Eds), Antibodies A 
Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring Harbor , NY 
(1988), Chapter 6. Antibodies that recognize and bind fi-agments of the GPCR 

15 polypeptides of the invention are also contemplated, provided that the antibodies are, 
first and foremost, specific for GPCR polypeptides. Antibodies of the invention can 
be produced using any method well known and routinely practiced in the art. 

Non-human antibodies may be humanized by any methods known in 
the art. In one method, the non-human CDRs are inserted uito a human antibody or 

20 consensus antibody fi-amework sequence. Further changes can then be introduced into 
the antibody framework to modulate affinity or inimunogenicity. 

Antibodies of the invention are useful for, for example, therapeutic 
purposes (by modulating activity of GPCR polypeptides), diagnostic purposes to 
detect or quantitate GPCR polypeptides, as well as purification of GPCR 

25 polypeptides. Kits comprising an antibody of the invention for any of the purposes 

described herein are also comprehended. In general, a kit of the invention also 
includes a control antigen for which the antibody is immunospecific. 

Specific binding molecules, including natural ligands and synthetic 
compounds, can be identified or developed using isolated or recombinant GPCR 

30 polypeptide products, GPCR polypeptide variants, or preferably, cells expressing such 
products. Binding partners are useful for purifying GPCR polypeptide products and 
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deteclion or quantification of GPCR polypeptide products in fluid and tissue samples 
using known immunological procedures. Binding molecules are also manifestly 
useful in modulating {i.e., blocking, inhibiting or stimulating) biological activities of 
GPCR polypeptides, especially those activities involved in signal transduction. 

The DNA and amino acid sequence information provided by the 
present invention also makes possible identification of binding partner compounds 
with which a GPCR polypeptide or polynucleotide will interact. Methods to identify 
binding partner compounds include solution assays, in vitro assays wherein GPCR 
polypeptides are immobilized, and cell based assays. Identification of binding partner 
compounds of GPCR polypeptides provides candidates for therapeutic or prophylactic 
intervention in pathologies associated with GPCR polypeptide normal and aberrant 
biological activity. 

The invention includes several assay systems for identifying GPCR 
polypeptide binding partners. In solution assays, methods of the invention comprise 
the steps of (a) contacting a GPCR polypeptide with one or more candidate binding 
partner compounds and (b) identifying the compounds that bind to the GPCR 
polypeptide. Identification of the compounds that bind the GPCR polypeptide can be 
achieved by isolating the GPCR polypeptide/binding partner complex, and separating 
the GPCR polypeptide from the binding partner compound. An additional step of 
characterizing the physical, biological, and/or biochemical properties of the binding 
partner compound is also comprehended in another embodiment of the invention. In 
one aspect, the GPCR polypeptide/binding partner complex is isolated using a 
antibody immunospecific for either the GPCR polypeptide or the candidate binding 
partner compound. 

In still other embodiments, either the GPCR polypeptide or the 
candidate binding partner compound comprises a label or tag that facilitates its 
isolation, and methods of the invention to identify binding partner compounds include 
a step of isolating the GPCR polypeptide/binding partner complex through interaction 
with the label or tag. An exemplary tag of this type is a poly-histidine sequence, 
generally around six histidine residues, that permits isolation of a compound so 
labeled using nickel chelation. Other labels and tags, such as the FLAG tag 
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(Easiman Kodak, Rochester, NY), well known and routinely used in the art, are 
embraced by the invention. 

hi one variation ofan in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized GPCR polypeptide with a 
5 candidate binding partner compound and (b) detecting binding of the candidate 

compound to GPCR polypeptide. In an alternative embodiment, the candidate 
binding partner compound is immobilized and binding of GPCR polypeptide is 
detected. Immobilization is accomplished using any of the methods well known in 
the art, including covalent bonding to a support, a bead, or a chromatographic resin, as 
10 well as non-covalent, high affinity interaction such as antibody binding, or use of 
streptavidin/biotin binding wherein the immobilized compound includes a biotin 
moiety. Detection of binding can be accomplished (i) using a radioactive label on the 
compound that is not immobilized, (ii) using a fluorescent label on the non- 
immobilized compound, (iii) using an antibody immunospecific for the non- 
15 immobilized compound, (iv) using a label on the non-immobilized compound that 

excites a fluorescent support to which the immobilized compound is attached, as well 
as other techniques well known and routinely practiced in the art. 

The invention also provides cell-based assays to identify binding 
partner compounds of a GPCR polypeptide. In one embodiment, the invention 
20 provides a method comprising the steps of contacting a GPCR polypeptide expressed 
on the surface of a cell with a candidate binding partner compound and detecting 
binding of the candidate binding partner compound to the GPCR polypeptide. In a 
preferred embodiment, the detection comprises detecting a calcium flux or other 
physiological cellular events caused by the binding of the molecule. 
25 Agents that modulate {i.e., increase, decrease, or block) GPCR 

polypeptide activity or expression may be identified by incubating a putative 
modulator with a cell expressing a GPCR polypeptide or polynucleotide and 
determining the effect of the putative modulator on GPCR polypeptide activity or 
expression. The selectivity of a compound that modulates the activity of GPCR 
30 polypeptides can be evaluated by comparing its effects on GPCR polypeptides to its 
effect on other G coupled-protein receptor compounds. Selective modulators may 
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include, for example, antibodies and other proteins, peptides, or organic molecules 
which specifically bind lo a G coupled-protein receptor polypeptide or a G coupled- 
protein receptor-encoding nucleic acid. Modulators of GPCR polypeptide activity 
will be therapeutically useful in treatment of diseases and physiological conditions in 
5 which normal or aberrant GPCR polypeptide activity is involved. 

iVlethods of the invention to identify modulators include variations on 
any of the methods described above to identify binding partner compoimds, the 
variations including techniques wherein a binding partner compound has been 
identified and the binding assay is earned out in the presence and absence of a 

1 0 candidate modulator. A modulator is identified in those instances where binding 

between the GPCR polypeptide and the binding partner compound changes in the 
presence of the candidate modulator compared to binding in the absence of the 
candidate modulator compound. A modulator that increases binding between the 
GPCR polypeptide and the binding partner compound is described as an enhancer or 

15 activator, and a modulator that decreases binding between the GPCR polypeptide and 

the binding partner compound is described as an inhibitor. 

The invention also comprehends high throughput screening (HTS) 
assays to identify compounds that interact with or inhibit biological activity (i.e., 
inhibit enzymatic activity, binding activity, etc.) of a GPCR polypeptide. HTS assays 

20 permit screening of large numbers of compounds in an efficient manner. Cell -based 
HTS systems are contemplated to investigate GPCR receptor-ligand interaction. HTS 
assays are designed to identify "hits" or "lead compounds" having the desired 
property, from which modifications can be designed to improve the desired property. 
Chemical modification of the "hit" or "lead compound" is often based on an 

25 identifiable structure/activity relationship between the "hit" and the GPCR 

polypeptide. 

Mutations in the GPCR gene that result in loss of nomial function of 
the GPCR gene product underlie GPCR polypeptide-related human disease states. 
The invention comprehends gene therapy to restore activity to treat those disease 
30 states. Delivery of a functional GPCR gene to appropriate cells is effected ex vivo, in 
Sim, or in vivo by use of vectors, and more particularly viral vectors {e.g., adenovirus, 
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adeno-associated vims, or a retrovirus), or av vivo by use of physical DNA transfer 
methods (e.g., hposomes or chemical treatments). See, for example, Anderson, 
Nattire, supplement to vol. 392, no. 0679. pp.25-20 (1998). For additional reviews of 
gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Vemia, 
5 Scientific American: 68-84 ( 1 990); and Miller, Nature, 357: 455-460 (1 992). 

Alternatively, it is contemplated that in other human disease states, preventing the 
expression of or inliibiting the activity of GPCR polypeptides of the invention will be 
useful in treating the disease states. It is contemplated that antisense therapy or gene 
therapy could be applied to negatively regulate the expression of GPCR polypeptides 
10 . of the invention. 

Additional features of the invention will be apparent from the 
following Examples. 

EXAMPLE 1 

15 Cloning of G Protein-Coupled Receptor s 

The Incyte and Genbank expressed sequence tag (EST) databases were 
searched with the NCBI program Blastall using either the transmembrane VI region of 
known dopamine receptors (leading to the identification of GON193, CONl 66, 
CON103 and CON 203) or all known GPCR's except olfactory and opsin receptors 
20 (leading to the identification of CON198, CON197, CON202, CON222, CON215) as 
query sequences, to find patterns suggestive of novel G protein-coupled receptors. 
Positive hits from the find-pattem program were further analyzed with the GCG 
program BLAST to determine which ones were the most Likely candidates to encode a 
GPCR, using the standard (default) alignment produced by BLAST as a guide. 

25 

A. Cloning of CQN193 G Protein-Coupled Receptor 
A.L Database Search Results 

Searching identified Clone 3091220H1 hi the Incyte database as an 
interesting candidate sequence. The 3091220H1 Clone was obtained and sequenced 
30 directly using an ABI377 fluorescence-based sequencer (Perkin-Elmer/ Applied 

Biosystems Division, PE/ABD, Foster City, CA) and the ABI PRISM™ Ready 
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Dye-Deoxy Terminator kit with Taq FSTM polymerase. Each ABf cycle sequencing 
reaction contained about 0.5 ofplasmid DNA. Cycle-sequencing was performed 
using an initial denaturation at 98*^0 tor 1 minute, followed by 50 cycles using the 
following parameters: 98°C for 30 seconds, annealing at 50°C for 30 seconds, and 
5 extension at 60*^C for 4 minutes. Temperature cycles and times were controlled by a 

Perkin-Elmer 9600 thermocycler. Extension products were purified using 
Centriflex^'^ gel filtration cartridges (Advanced Genetic Technologies Corp., 
Gaithersburg, MD). Each reaction product was loaded by pipette onto the column, 
which was then centrifuged in a swinging bucket centrifuge (Sorvall model RT6000B 

1 0 tabletop centri fuge) at 1 500 x ^ for 4 minutes at room temperature. Column-purified 
samples were dried under vacuum for about 40 minutes and then dissolved in 5 ^il of a 
DNA loading solution (83% deionized formamide, 8.3 mM EDTA, and 1 .6 mg/ml 
Blue Dextran). The samples were then heated to 90'*C for three minutes and loaded 
into the gel sample wells for sequence analysis using the AB1377 sequencer. 

1 5 Sequence analysis was done by importing ABi377 files into the Sequencer program 

(Gene Codes, Ann Arbor, MI). Generally, sequence reads of 700 bp were obtained. 
Potential sequencing errors were minimized by obtaining sequence information fi-om 
both DNA strands and by re-sequencing difficult areas using primers annealing at 
different locations until all sequencing ambiguities were removed. 

20 From the sequence it was deduced that Clone 3091220H1 contained only an 

amino-terminal firagment of a putative GPCR corresponding to the third through the 
seventh transmembrane regions (3TM-7TM) of a GPCR. Referring to SEQ ID NO: 1, 
the nucleotide sequence of Clone 3091220H1 corresponds to nucleotides 404 to 1308 
of what was eventually determined to be the complete sequence of a novel seven- 

25 transmembrane receptor designated CONl 93. A database search with this partial 

sequence showed a 56% match to members of the olfactory receptor gene family, e.g., 
the gene encoding mouse odorant receptor SI 9. 

A.2 Screening of a Genomic Phage Library to Obtain a Full-Length GPCR 
Clone: 

^0 The PCR technique was used to prepare a genomic fragment for use as 

a probe specific for the genomic CONl 93 Clone. Based on the complete sequence of 
Clone 30912201-11, two oligonucleotide primers were designed: Primer LW1282: 5'- 
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TAATACCTGCACTGCCCAC-3* (SEQ ID NO: 21 ; see nucleotides 876-894 of SEQ 
ID NO:i) and Primer LWI283: 5^TCTTTCCTTCTCTTCTCACTCC-3* (SEQ ID 
NO: 22 see nucleotides 1 137-1 158 of SEQ ID NO: I), These primers were designed 
to amplify a 283 base-pair fragment of genomic DNA containing a portion of the 
5 CON 193 coding region found in Clone 3091 220H 1 (assuming the absence of introns 

in this region). 

Initially, a suitable human genomic library constructed in EMBL3 
SP6/T7 (Clontech Laboratories) was amplified to provide the materials required for 
screening. Two microliters of the human genomic library (approximately 10** plaque- 

1 0 forming units per milliliter; Clontech Laboratories, catalog number HLl 067J) were 

added to 6 ml of an overnight culture of K802 cells (Clontech Laboratories), and 250 
^il aliquots were distributed into each of 24 tubes. The tubes were incubated at 37°C 
for 15 minutes, and then 7 ml of 0.8% agarose {i.e., top agarose) at 50°C were added 
to each tube. After mixing, the contents of the tubes were poured onto 150 mm LB 

1 5 plates and incubated overnight at 37''C to allow clone amplification, evident as plaque 

formation (typically, confluent lysis was observed rather than discrete plaques). To 
each plate, 5 ml of SM phage buffer (0.1 M NaCl, 8.1 |iM MgS04*7H20, 50 mM 
Tris-HCl (pH 7.5), and 0.0001 % gelatin) was added and the top agarose was removed 
by scraping with a microscope slide. Top agarose slurries containing phage were then 

20 placed in individual 50 ml centrifiige tubes. A drop of chloroform was added and 
each tube was placed in a 37*^0 shaker for 15 minutes, followed by centrifuging at 
2,750 X ^ for 15 minutes. The supematants were isolated and separately stored at 4*'C 
as 24 stock solutions of amplified library clones. 

As noted above, polymerase chain reaction (PGR) was selected as a 

25 technique for screening the phage hbrary. Each PGR reaction was done in a 20 \il 

reaction volume containing 8.84 ^1 HjO, 2\i\\0X PCR buffer 11 (Perkin-Elmer), 2 \i\ 
25 mM MgClj, 0.8 ^1 dNTP mixture (dATP, dCTP, dGTP, dCTP, each at 10 mM), 
0.12 |al primer LW1282 (approximately 1 [ig/\x\\ 0.12 \x\ primer LW1283 
(approximately 1 \ig/\ilX 0.12 \il AmpliTaq Gold polymerase (5 Units/^il, with "Units" 

30 as defined by the suppher, Perkin-Elmer) and 2 \i\ of phage from one of the 24 stock 
tubes. The PGR reaction involved 1 cycle at 95°C for 10 minutes and SO^'C for 20 
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minutes, followed by 22 cycles at 95°C for 30 seconds, 72-5 1°C for 2 minutes (72°C 
for this stage of the second cycle, with a decrease of one degree for this stage in each 
succeeding cycle), 72X for one minute, followed by 30 cycles at 95°C for 15 
seconds, 50*^0 for 30 seconds, and 72°C for one minute. 
5 Following PGR cycling, the contents from each reaction tube were 

loaded onto a 2% agarose gel and electrophoresed adjacent to known size standards to 
screen for PGR products of the expected size, indicative of a clone containing the 283 
bp portion of Clone 3091220HI ampli fied by the two selected primers. A positive 
signal (Le., a fragment of the expected size) was found in one of the 24 PGR 
10 reactions, thereby identifying a single stock genomic library tube containing positive 
clones. 

From the original genomic library tube that had given a PGR product 
of the correct size, a 5 fa! phage aliquot was used to establish a set of five serial 
dilutions (I /1 00, v/v) that were plated and incubated in the same manner as described 

15 for the amphfication of the phage library. Following incubation, BA85 nitrocellulose 

filters (Schleicher & Schuell) were placed on top of each of the plates for I hour to 
adsorb phage from the plaques that had formed in the top agarose during incubation. 
Each filter was then gently removed, placed phage side up in an individual petri dish, 
and covered with 4 ml of SM buffer for 15 minutes to elute the phage. One milliliter 

20 of SM containing eluted phage was removed fi-om each plate and used to set up a PGR 
reaction as described above. The plate containing the most dilute phage solution to 
yield a PGR product of the expected size was then subdivided using the following 
procedure. A BA85 filter was placed on the top agar of the plate and the medium 
with applied filter was physically divided into 24 sections. After one hour to allow 

25 phage adsorption to the 24 filters, each fiUer was removed and separately incubated in 

1 ml of SM buffer at room temperature for 15 minutes. Two microliters of each 
eluted phage solution were then used as a PGR substrate. Those plate sections 
yielding positive PGR results were then subdivided into 12 subsections by removing 
the top agar and incubating it in 200 ^1 of SM buffer for one hour at room 

30 temperature. Again, 2 ^il of the eluted phage solutions were plated and lifted using 
BA85 filters, and PGR reactions were repeated. The procedure for progressive 
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dilution of phage was continued until a single plaque was isolated. Subsequently, 10 
^il of eluted phage from thai single plaque were added to 1 00 m-1 SM and 200 ^1 of 
K802 cells for plating in a single pelri dish as described above. A total of 7 plates 
were inoculated in this manner. Following incubation at 37°C for 16 hours, the top 
5 agarose from each of the 7 plates was removed to recover the phage, which were used 

to prepare purified genomic phage DNA using the Qiagen Lambda Midi Kit. 

The purified CON) 93 genomic phage DNA was sequenced using the 
ABl PRISM'*'^ 310 Genetic Analyzer (Perkin-Elmer/ Applied Biosystems) which uses 
advanced capillary electrophoresis technology and the ABI PRTSM™ BigDye''""' 

1 0 Terminator Cycle Sequencing Ready Reaction Kit. The cycle-sequencing reaction 

contained 18 ^1 of H2O, 16 |il of BigDye *^'"^ Termmator mix, 3 \x\ of genomic phage 
DNA (0.26 ^ig/p-I), and 3 ^il primer (25 ng/^il). The reaction was performed in a 
Perkin-Elmer 9600 thermocycler at 95''C for 5 minutes, followed by 75 cycles of 
95°C for 30 seconds, 55*^0 for 20 seconds, and 60°C for 4 minutes. The final 

15 subclone was also sequenced using the ABl PRISM"^^ 310 Genetic Analyzer. The 

cycle-sequencing reaction contained 6 p.1 of HjO, 8 ^1 of BigDye^*^ Temiinator mix, 5 
[i] of miniprep clone DNA (0.1 \ig/\i\), and 1 \il primer (25 ng/nl). The reaction was 
performed in a Perkin-Elmer 9600 thermocycler at 25 cycles of 96^*0 for 10 seconds, 
50°C for 1 0 seconds, and 60*^0 for 4 minutes. The product of the PGR reaction was 

20 purified using Centriflex™ gel filtration cartridges, dried under vacuum, and 

dissolved in 16 p.1 of Template Suppression Reagent (PE-Apphed Biosystems). The 
samples were then incubated at 95''C for 5 minutes and placed in the 310 Genetic 
Analyzer. These efforts resulted in the determination of the CON193 polynucleotide 
sequence set forth in SEQ ID N0:1 and the deduced amino acid sequence of the 

25 encoded CONl 93 polypeptide which is set forth in SEQ ID N0;2. 

A.3 Subcloning of the Coding Region of CON193 via PGR 

Additional experiments were conducted to subclone the coding region 
of CON 193 and place the isolated coding region into a useful vector. Two additional 
PGR primers were designed based on the coding region of CON 193. The first PGR 

30 primer, designated Primer LW1373, has the sequence 5'-GCATAAGGTTATGCTA- 
AGACTGAATAAAAGAG-3' (SEQ ID NO: 23), nucleotides 1 1-32 of which 
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correspond to nucleotides 157-178 of SEQ ID NO: 1, The second PGR primer is 
Primer LWI374, which has the sequence 5'-GCATCTCGAGTCACA- 
TGCTGTAGGATTTGG-3' (SEQ ID NO; 24, nucleotides 1 1-30 of which correspond 
to the complement of nucleotides 1 102-1 1 21 of SEQ FD NO: 1. To protect against 
5 exonucleolytic attack during subsequent exposure to enzymes, e.g., Taq polymerase, 

primers were routinely synthesized with a protective run of nucleotides at the 5' end 
that were not necessarily complementaiy to the desired target. 

PGR was performed in a 50 |il reaction containing 35 ^l HjO, 5 \i\ lOX 
TT buffer (140 niM ammonium sulfate, 0.1% gelatin, 0.6 M Tris-tricine, pH 8.4), 5 \i\ 

10 1 5 mM MgSO^, 2 \xl dNTP mixture (dGTP, dATP, dTTP, and dCTP, each at 10 mM), 

2 ^il genomic phage DNA (0.26 ^tg/^ll), 0.3 ^il Primer LW1373 (1 |ig/^l), 0.3 ^il 
Primer LW1374 (1 |ig/^l), 0.4 ^1 High Fidelity Taq polymerase (Boehringer 
Mannheim). The PGR reaction was started with 1 cycle of 94°G for 2 minutes; 
followed by 15 cycles at 94*^0 for 30 seconds, 55°C for 30 seconds, and 72°G for 1.3 

1 5 minutes. 

The contents from the PGR reaction were loaded onto a 2% agarose 
gel, fractionated and electroeluted. The DNA band of expected size was excised from 
the gel,, placed in a GenElute Agarose spin column (Supelco) and spun for 10 minutes 
at maximum speed in a microcentrifuge. The eluted DNA was precipitated with 

20 ethanol and resuspended in 6 \l[ HjO for ligation. 

The PGR-amplified DNA fragment containing the GON193 coding 
region was cloned into pGR2.1 using a protocol standard in the art. In particular, the 
Ugation reaction consisted of 6 \il of GONl 93 DNA, 1 \x\ lOX ligation buffer, 2 \l\ 
pGR2.1 (25 ng/^i], Invitrogen), and 1 ^1 T4 DNA ligase (Invitrogen). The reaction 

25 mixture was incubated overnight at 14°G and the reaction was then stopped by heating 

at 65°G for 10 minutes. Two microliters of the ligation reaction were transformed 
into One Shot cells (hivitrogen) and plated onto ampicillin plates. A single colony 
containing an insert was used to inoculate a 5 ml culture of LB medium. The culture 
was grown for 18 hours and the plasmid DNA was purified using the Goncert Rapid 

30 Plasmid Miniprep System (GibcoBRJL) and sequenced. Following confirmation of the 

sequence, pGR-GONl 93 was identified, and a 50 ml culture of LB medium was 
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inoculated and recombinant plasmid DNA was purified using a Qiagen Plasmid Midi 
Kit to yield purified pCR-CON193. 

B. Cloniiis ot CQN166 G Protein-Coupled Receptor 
5 B.I Database Search Results 

The database searching identified clone 2553280H 1 in the Incyte 
database as aji interesting candidate sequence. The 2553280H1 clone was obtained 
and sequenced directly using an AB1377 fluorescence-based sequencer and the ABl 
PRISM Ready Dye-Deoxy Tenninator kit with Taq FSTM polymerase as described 

10 above for CON 193 in Example 1 A.l . From the sequence it was deduced that clone 
2553280HI contained 349 nucleotides of a GPCR coding region comprising a 
carboxy-terminal fragment of a putative GPCR corresponding to the sixth and seventh 
transmembrane regions (6TM and 7TM). In addition, clone 2553280H1 contained 
1 2 kb of the 3' untranslated sequence of that GPCR. Referring to SEQ ID NO: 3, the 

1 5 nucleotide sequence of Clone 2553280H1 corresponds to nucleotides 663 to 1,014 of 
what was eventually determined to be die complete sequence of a novel seven- 
transmembrane receptor that was designated CONl 66. A database search with this 
partial sequence showed a 44% match to an activated T cell-specific G protein- 
coupled receptor. 

20 B2. Screening of a Genomic Phage Library to Obtain a 
Full-Length GPCR Clone 

The PCR technique was used to prepare a genomic fragment for use as 

a probe specific for the genomic CON166 clone. Based on the complete sequence of 
clone 2553280H1, two oligonucleotide primers were designed: Primer LW1278: 5'- 

25 ACCGCTGCCTTTTTAGTC-3* (SEQ ID NO: 28; see nucleotides 715 to 732 of SEQ 

ID NO: 3 and Primer LW1279: 5'-CCTTCTTTCTGGGTACATAAGTC-3' (SEQ ID 
NO: 29; see the reverse complement of nucleotides 951-973 of SEQ ID NO: 3). 
These primers were designed to amplify a 259 base-pair fragment of genomic DNA 
containing a portion of the CONl 66 coding region found in clone 2553280HI 

30 (assuming the absence of introns in this region). 

Initially, a suitable human genomic library constructed in EMBL 
SP6/T7 was amplified to provide the materials required for screening as described 
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above for CON 1 93 in Example 1 A.2. Polymerase chain reaction (PCR) was selected 
as a technique for screening the phage library. Each PCR reaction was done in a 20 \x\ 
reaction volume containing 8.84 ^il Hfi, 2 [il LOX PCR buffer II (Perkin-EImer), 2 ^il 
25 mM MgCl2, 0.8 [l\ dNTP mixture (dATP, dCTP, dGTP, dCTP, each at 10 niM), 
5 0. 1 2 fil primer LW 1 278 (approximately 1 ^g/^l), 0.12^1 primer LW 1 279 

(approximately 1 |ig/^l), 0.12 ^1 AmpliTaq Gold polymerase (5 Units/^l, with "Units" 
as defined by the supplier, Perkin-EImer) and 2 ^1 of phage from one of the 24 stock 
tubes. The PCR reaction involved 1 cycle at 95°C for 10 minutes and 80°C for 20 
minutes, followed by 12 cycles at 95°C for 30 seconds, 72-61 °C for 2 minutes {72°C 

10 for this stage of the second cycle, with a decrease of one degree for this stage in each 
succeeding cycle), 72°C for 30 seconds, followed by 30 cycles at 95°C for 15 seconds, 
60°C for 30 seconds, and 72°C for 30 seconds. 

Following PCR cycling, the contents from each reaction tube were 
loaded onto a 2% agarose gel and electrophoresed adjacent to known size standards to 

15 screen for PCR products of the expected size of 259 bp, indicative of a clone 

containing the portion of clone 2553280H1 amplified by the two selected primers. A 
positive signal (/.e., a fragment of the expected size) was found in one of the 24 PCR 
reactions, thereby identifying a single stock genomic library tube containing positive 
clones, 

20 From the original genomic library tube that had given a PCR product 

of the correct size, a 5 fil phage aliquot was used to amplify the CON166 genomic 
phage DNA as described for CON 193 above in Example 1 A.2. For the amplification 
of the phage library, the plates containing the diluted phage solution were subdivided 
into 12 sections unlike that of CON193; otherwise the procedures were identical. 

25 The purified CON 166 genomic phage DNA was sequenced using the 

ABI PRISM*^ 310 Genetic Analyzer which uses advanced capillary electrophoresis 
technology and the ABl PRISM*'*^ BigDye^"'^ Terminator Cycle Sequencing Ready 
Reaction Kit as described above for CON193 in Example 1 A.2. These efforts 
resulted in the determination of the CON166 polynucleotide sequence set forth in 

30 SEQ ID NO: 3 and the deduced amino acid sequence of the encoded CON 166 
polypeptide which is set forth in SEQ ID NO: 4. 
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B.3 Subcloning of the Coding Region of CON166 via PGR 

Acidilional experiments were conducted to subclone the coding region 
of CON166 from the genomic clone and place the isolated coding region inlo a useful 
vector. Two additional PCR primers were designed based on the coding region of 
5 CON 166. The first PCR primer, designated Primer LW1405, has the sequence 

5'-AAGCATAACATGGATGAAACAGGAAATCTG-3' (SEQ ID NO: 29, . 
nucleotides 10-30 of which con espond to nucleotides 1-21 of SEQ ID NO: 3). To 
protect against exonucleolytic attack during subsequent exposure to enzymes, e.g., 
Taq polymerase, primers were routinely synthesized with a protective run of 
10 nucleotides at the 5' end that were not necessarily complementary to the desired 
target. The second PCR primer is Primer LW1406, which has the sequence 5'- 
AAGCATAACTATACTTTACATATTTCTTC-3' (SEQ TD NO: 30, nucleotides 9-29 
of which correspond to the reverse complement of nucleotides 994-1014 of SEQ ID 
NO: 3), 

15 PCR was performed in a 50 \il reaction containing 34 pi HjO, 5 ^xl lOX 

TT buffer (140 niM ammonium sulfate, 0.1% gelatin, 0.6 M Tris-tricine, pH 8.4), 5 ^1 
15 mM MgS04, 2 |jil dNTP mixture (dGTP, dATP, dTTP, and dCTP, each at 10 mM), 
3 pi genomic phage DNA (0.25 pg/^il), 0.3 ^1 Primer LW1405 (1 ^lg/nl), 0.3 pi 
Primer LW1406 (1 pg/pl), 0.4 pi High Fidelity Taq polymerase (Boehringer 

20 Mannheim). The PCR reaction was started with 1 cycle of 94°C for 2 minutes; 

followed by 25 cycles at 94°C for 30 seconds, 55 °C for 30 seconds, and 72°C for 1.3 
minutes. 

The contents from the PCR reaction were loaded onto a 2% agarose gel 
and fractionated. The DNA band of expected size (1,031 bp) was excised from the 
25 gel, placed in a GenElute Agarose spin column (Supelco) and spun for 10 minutes at 
maximum speed in a microfuge. The eluted DNA was precipitated with ethanol and 
resuspended in 6 pi HjO for ligation. 

The PCR-amplified DNA fragment containing the CON 166 coding 
region was cloned into pCR2.1 to generate pCR-CONl 66 using a protocol standard in 
30 the art. In particular, the ligation reaction was carried out as described for CON 193 in 
Example 1 A.3. The resulting plasmid DNA was purified using the Concert Rapid 
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Plasmid Miniprep System (GibcoBRL) and sequenced. Following confirmation of the 
sequence, a 50 ml culture of LB medium was inoculated with the transformed One 
Shot cells, cultured, and processed using a Qiagen Plasmid Midi Kit to yield purified 
pCR-CONi66. 

5 

C. Cloning of CON103 G Protein-Coupled Receptor 
C.l Database Search Results 

The database searching identified clone 1581220H1 in thelncyte 
database as an interesting candidate sequence. The 1581220H1 clone was obtained 

10 and sequenced directly using an ABI377 fluorescence-based sequencer and the ABl 
PRISM*^^ Ready Dye-Deoxy Terminator kit with Taq FSTM polymerase as described 
for CON 193 in Example 1 A.l . From the sequence it was deduced that clone 
1581220H1 contained 454 nucleotides of a GPCR coding region comprising a 
carboxy-terminal fragment of a putative GPCR corresponding to the sixth and seventh 

15 transmembrane regions (6TM and 7TM). In addition, clone 1581220H1 contained 

1.2 kb of the 3' untranslated sequence of that GPCR. Referring to SEQ ID NO: 5, the 
nucleotide sequence of clone 1 581 220H1 corresponds to nucleotides 698 to 1 190 of 
what was eventually determined to be the complete sequence of a novel seven- 
transmembrane receptor designated CON103. A database search with this partial 

20 sequence showed a 44% match to an activated T cell-specific G protein-coupled 
receptor. 

C.2 Screening of a Genomic Phage Library 
to Obtain a FuH-Length GPCR Clone 

The PGR technique was used to prepare a genomic fragment for use as 

25 a probe specific for the genomic CON103 clone. Based on the complete sequence of 

clone 1581220HI, two oligonucleotide primers were designed: Primer LW1280: 5'- 

TCTGCACACAGCTCTTCCATGG-3' (SEQ ID NO: 32; see nucleotides 1568-1589 

of SEQ ID NO: 5) and Primer LW1281: 5'-TCCCTTGTCCAGTTGGTTGAGG-3' 

(SEQ TD NO: 33; see nucleotides 1926 to 1947 of SEQ ID NO: 5. These primers 

30 were designed to amplify a 380 base-pair fragment of genomic DNA containing a 

, portion of the CON103 coding region found in clone 1581220H1 (assuming the 

absence of introns in this region). 
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Jnilially, a suitable human genomic library constructed in EMBL 
SP6/T7 was amplified to provide the materials required for screening as described 
above for CON193 in Example I A,2. Polymerase chain reaction (PGR) was selected 
as a teclinique for screeiiing the phage library. Each PGR reaction was done in a 20 \i\ 
5 reaction volume containing 8.84 \x] HjO, 2 ^1 1 OX PGR buffer II (Perkin-Elmer), 2 \il 

25 mM MgGlj, 0.8 ^li dNTP mixture (dATP, dXTP, dGTP, dGTP, each at 10 mM), 
0.12 ^l primer LW 1280 (approximately 1 \ig/\x\\ 0.12 ^1 primer LW 1281 
(approximately 1 ^g/|il), 0.12 ^iJ AmpliTaq Gold polymerase (5 Units/p.1, with "Units" 
as defined by tlie supplier, Perkin-Elmer) and 2 ^1 of phage from one of the 24 stock 

1 0 tubes. PGR amplification reactions using each one of the other 23 stock collections of 
genomic clones were performed under the same conditions. The PGR reaction 
involved 1 cycle at 95°G for 1 0 minutes and 80°G for 20 minutes, followed by 12 
cycles at 95°G for 30 seconds, 72-61°C for 2 minutes (72°G for this stage of the 
second cycle, with a decrease of one degree for this stage in each succeeding cycle), 

1 5 72°G for one minute, followed by 30 cycles at 95°G for 1 5 seconds, 60'^C for 30 

seconds, and 72*^G for 30 seconds. 

Following PGR cycling, the contents from each reaction tube were 
loaded onto a 2% agarose gel and electrophoresed adjacent to known size standards to 
screen for PGR products of the expected size of 380 bp, indicative of a clone 

20 containing the portion of clone 1581220H1 amplified by the two selected primers. A 
positive signal (i.e., a fragment of the expected size) was found in one of the 24 PGR 
reactions, thereby identifying a single stock genomic library tube containing positive 
clones. 

From the original genomic library tube that had given a PGR product 
25 of the correct size, a 5 ^1 phage aliquot was used to amplify the GON 103 genomic 

phage DNA as described above for GON 193 in Example IA.2. A total of 8 plates 
were inoculated with eluted phage in this manner described above. Following 
incubation at 37°G for 16 hours, the top agarose froau each of the 8 plates was 
removed to recover the phage, which were used to prepare purified genomic phage 
30 DNA using the Qiagen Lambda Midi Kit. 
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The CON 103 clone was sequenced using the ABI PRISM™ 310 
Genetic Analyzer. The cycle-sequencing reaction contained 6 \x\ ofHjO, 8 ^1 of 
BigDye''^' Terminator mix, 5 |il of miniprep clone DNA (0. 1 |ig/[ii), and 1 ^lI primer 
(25 ng/^1). The reaction was perfonned in a Perkin-Ehiier 9600 thermocycler at 25 
5 cycles or96^C for 10 seconds, 50''C for 10 seconds, and 60°C for 4 minutes. The 

product of the PGR reaction was purified using Gentriflex^'^ gel filtration cartridges, 
dried under vacuum, and dissolved in 16 p,l of Template Suppression Reagent (PE- 
Applied Biosystems). The samples were then incubated at 95*'G for 5 minutes and 
placed in the 310 Genetic Analyzer. These efforts resulted in the detennination of the 
10 GON103 polynucleotide sequence set forth in SEQ ID NO: 5 and the deduced amino 

acid sequence of the encoded CON103 polypeptide which is set forth in SEQ ID NO: 
6, 

C.3 Subcloning of the Coding Region of CON103 via PGR 

Additional experiments were conducted to subclone the coding region 

15 of CON 103 from the genomic clone and place the isolated coding region into a usefiil 
vector. Two additional PGR primers were designed based on the sequence of the 
coding region of GON103: Primer LW1385 {5'-GGATAAGGT- 
TGCATGGAAGTTGATAAGGTG-3'; SEQ ID NO: 34, nucleotides 13-30 of which 
correspond to nucleotides 1-18 of SEQ ID NO: 5) and Primer LW 1386 (5*- 

20 GGATGTCGAGTTAGGGGGAGAGGGGTGGAG-3'; SEQ ID NO: 35, nucleotides 
1 1-30 of which correspond to the reverse complement of nucleotides 1 171-1190 of 
SEQ ID NO: 5). To protect against exonucleolytic attack during subsequent exposure 
to enzymes, e.g., Taq polymerase, primers were routinely synthesized with a 
protective run of nucleotides at the 5' end that were not necessarily complementary to 

25 the desired target. 

PGR was performed in a 50 |xl reaction containing 22.6 \xl H2O, 5 p.1 
lOX TT buffer (140 mM ammonium sulfate, 0.1% gelatin, 0.6 M Tris-tricine, pH 8.4), 
5 ^il 15 mM MgSO^, 10 |il rapid dye (Origene), 2 ^l dNTP mixture (dGTP, dATP, 
dTTP, and dGTP, each at 10 mM), 0.5 ^1 genomic phage DNA (0.97 p-g/^il), 03 ^1 

30 Primer LWl 385 (1 \ig/\x]\ 0.3 ^il Primer LW1386 (1 \xg/\i\), and 0,4 |il HighFidehty 

Taq polymerase (Boehringer Mannheim). The PGR reaction was started with 1 cycle 
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of 94°C for 2 minutes, Followed by 12 cycles at 94°C for 30 seconds, 55°C for 30 
seconds, and 72°C for 1.3 minutes. 

The contents from the PGR reaction were loaded onto a 2% agarose gel 
and fractionated. The DN A band of expected size (1 ,212 bp) was excised from the 
5 gel, placed in a GenElute Agarose spin column (Supelco) and spun for 10 minutes at 

maximum speed in a microcentrifuge. The eluted DNA was precipitated with ethanoi 
and resuspended in 6 \i\ HjO for ligation. 

The PCR-amplified DNA J&agment containing the CON 1 03 coding 
region was cloned into pCR2.1 using a protocol standard in the art. In particular, the 
10 ligation reaction was carried out as described above for CON193 in Example 1 A.3. 

The resulting plasmid DNA was purified using the Concert Rapid Plasmid Miniprep 
System (GibcoBRL) and sequenced. Following confirmation of the sequence, pCR- 
CON103 was identified, and a 50ml cuUure of LB medium was inoculated, cultured, 
and processed using a Qiagen Piasraid Midi Kit to yield purified pCR-CON103. 

15 

D. Cloning of CON203 G Protein-Coupled Receptor 
D,l Database Search Results 

The database searching identified clone 3210396H1 in the hicyte 
database as an interesting candidate sequence. The 32l6396Hl clone was obtained 

20 and sequenced directly using an ABI377 fluorescence-based sequencer and the ABl 
PRISM '"'^ Ready Dye-Deoxy Terminator kit with Taq FSTM polymerase as described 
above for CON 193 in Example 1 A.l. From the sequence it was deduced that clone 
3210396H1 contained all 1,002 nucleotides of a GPCR coding region (see SEQ ID 
NO: 7). A database search with this sequence showed a 33% match to a platelet 

25 activating receptor (Gene H963, GenBank Acc. No. AF002986). 

D.2 Subcloning of the Coding Region of CON203 via PCR 

Additional experiments were conducted to subclone the coding region 
of CON203 and place the isolated coding region into a useful vector. Two additional 
PCR primers were designed based on the sequence of the coding region of CON203: 

30 Primer LWl 329: 5'-GCATCTCGAGTCAGCCTAAGG'lTATGTTG-3' (SEQ E) 

NO: 36; see nucleotides 984 to 1,002 of SEQ ID NO: 7 for the reverse complement of 
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nucleotides 9-29 of SEQ ID NO: 36) and Primer LW1377: 5'- 
GCATAAGCTTATGAACACCACAGTGATGC-3* (SEQ ID NO: 37; see 
nucleotides 1-19 of SEQ ID NO: 7 which correspond to nucleotides 11-29 of SEQ ID 
NO: 37). To protect against exonucleolytic attack during subsequent exposure to 
5 enzymes, e.g., Taq polymerase, primers were routinely synthesized with a protective 

run of nucleotides at the 5' end lhat were not necessarily complementary to the 
desired target. These primers were designed to amplify a 1 ,020 base-pair fragment of 
clone 3210396H1 containing the complete coding region of CON203. 

PGR was perfonned in a 50 ^il reaction containing 34 |il HjO, 5 p.1 I OX 

1 0 TT buffer (140 mM ammonium sulfate, 0.1% gelatin, 0.6 M Tris-tricine, pH 8.4), 5 |il 
1 5 mM MgSO^, 2 \l\ dNTP mixture (dGTP, dATP, dTTP, and dCTP, each at 1 0 mM), 
3 ^il clone 3210396H1 (miniprep DNA), 0.3 ^ll Primer LW1329 (1 \ig/\i\), 0.3 ^li 
Primer LW1377 (1 M-g/fil), and 0.4 nl High Fidelity Taq polymerase (Boehringer 
Mannheim). The PGR reaction was started with 1 cycle of 94^*0 for 2 minutes, 

15 followed by 12 cycles at 94°C for 30 seconds, 55°C for 30 seconds, and 72°C for 1 .3 

minutes. 

The contents from the PGR reaction were loaded onto a 1 .2% agarose 
gel and fractionated. The DNA band of expected size (1,020 bp) was excised from 
the gel, placed in a GenElute Agarose spin column (Supelco) and spun for 10 minutes 

20 at maximum speed in a microcentrifuge. The eluted DNA was precipitated with 
etlianol and resuspended in 6 \i\ H2O for ligation. 

The PCR-amplified DNA fragment containing the COlsl203 coding 
region was cloned into pCR2.1 using a standard protocol and the Original TA Cloning 
Kit (Invitrogen). Ligation reactions were carried out as described above for CON 1 93 

25 in Example 1 A.3. The resulting plasmid DNA was purified using the Concert Rapid 

Plasmid Miniprep System (GibcoBRL) and sequenced. Following confimiation of the 
sequence, pCR-C203 was identified, and a 50 ml culture of LB medium was 
inoculated, cultured, and processed using a Qiagen Plasmid Midi Kit to yield purified 
pCR-C203. 

30 The CON203 clone was sequenced using the ABI PRISM ''"^ 3 1 0 

Genetic Analyzer (P-E Applied Biosystems), which uses advanced capillary 
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electrophoresis technology and the ABl Prism'^^' BigDye''^' Terminator Cycle 
Sequencing Ready Reaction Kit. The cycle-sequencing reaction contained 6 [i\ of 
H2O, 8 ^l of BigDye"^ Terminator mix, 5 ^1 of miniprep clone DNA (0.1 |xg/|il), and 
1 \x\ primer (25 ng/[xl). The reaction was perfomied in a Perkin-Elmer 9600 
5 thennocycler using the following conditions: 25 cycles of96''C for 10 seconds, SO^'C 

for 1 0 seconds, and 60*^0 for 4 minutes. The product of the PCR reaction was 
purified using Centriflex^'^ gel filtration cartridges, dried under vacuum, and 
dissolved in 16 ^1 of Template Suppression Reagent (PE-Applied Biosystems). The 
samples were then incubated at 95°C for 5 minutes and placed in the 310 Genetic 
10 Analyzer. 

Initially, these efforts showed that the CON203 coding region cloned 
into pCR2.1 had a single bp difference from the corresponding sequence of clone 
3210396H1 . The single bp change in the pCR2. 1 clone was eliminated by 
conforming that sequence to the sequence of clone 3210396HI using the QuikChange 

1 5 Site-Directed Mutagenesis Kit (Stratagene). The method involves modification of a 

sequence during PCR ampHfication, for which PCR primers LW1387 (5'- 
GAGAAATATTTTTCTAAAAAAACCTGTTTTTGCAAAAACGG-3^ SEQ ID 
NO: 38) and LW1388 (5'-CCGTTTTTGCAAAAACAGGTTTTTTTAGAAAA- 
ATATTTCTC.3'; SEQ ID NO: 39) were used. The PCR reaction contained 40 ^1 

20 H2O, 5 ^ll OX proprietary Reaction Buffer (Stratagene), 1 ^1 pCR-C203 (0. 1 25 fig/|xl) 

mini-prep DNA, 1 dNTP mixture (dGTP, dATP, dTTP, and dCTP, each at 10 
mM), 1 |il Pfu DNA polymerase (2.5 Units/^il), 1 |il LW1387 (125 ng/\il) and 1 ^1 
LW1388 (125 ng/ 1). The cycle conditions were 95°C for 30 seconds, followed by 12 
cycles at 95°C for 30 seconds, 55°C for 1 minute, and 68°C for 12 minutes. The tube 

25 was then placed on ice for 2 minutes and 1 ^1 of Dpnl was added. The tube was then 

incubated at 37''C for one hour. One microliter of the Opnl-treated DNA was 
transformed into Epicurian coH XLl-Blue supercompetent E. coli cells. Following 
. isolation of pCR-C203, the entire insert was re-sequenced, thereby successfully 
verifying repair of the single-site polymorphism. As expected, the sequence of the 

30 CON203 coding region determined using this pCR2.1 clone is in complete agreement 
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wiih the CON203 coding region sequence of SEQ ID NO: 7 which specifies the 
amino acid sequence set forth in SEQ ID NO: 8. 

E. Cloning of CON J 98 G Protein-Coupled Receptor 
5 E.I Database Search Results 

The database searching identified Clone 3359808HI in the Incyte 
database as an interesting candidate sequence. The 3359808HI clone was obtained 
and sequenced using standard techniques. From the sequence it was deduced that 
Clone 3359808HI contained the entire coding region for a previously unidentified 

10 GPCR, which was designated "CON 198." The DNA and deduced amino acid 

sequences for CON198 are set forth in SEQ ID NOS: 9 and 10, respectively, A 
database search with this CON198 DNA sequence showed a 61% match to the rat 
putative GPCR designated RAlc [Raming eL al, Recept Channels, 6: 141-151 
(1998)] and 46% identity to an olfactory receptor. 

15 E.2 Subcloning of the Coding Region of CON198 via PCR 

Additional experiments were conducted to subclone the coding region 
of the CON 198 clone into a useful vector. Two PCR primers were designed based on 
the coding region of CON198 for the purpose of PCR amplification of the CON198 
coding sequence. The first. Primer LW1326, from 5' to 3' (SEQ ID NO: 42): 

20 GCATGAATTC ATGATGGTGGATCCCAATGG . includes the 5 ' end of the 

CON 198 coding sequence (underlined) as well as a EcoRl restricfion site, useful for 
subsequent expression work. The second, Primer LW 1 327, from 5 ' to 3 ' (SEQ ID 
NO: 43): GCATCTCGAGC CTAGGGCTCTGAAGCG . includes sequence 
complementary to the 3' end of the CON 198 coding sequence (underlined), preceded 

25 by a A7/oT restriction site sequence usefiil for subsequent cloning and expression work. 

The PCR was performed in a 50 ^il reaction containing 34 ^il HjO, 5 ^il 
of I OX TT buffer (140 mM Ammonium Sulfate, 0.1% gelatin, 0.6 M Tris-tricine, pH 
8.4), 5 til of 15 mM MgSO^, 2 ^1 of 10 niM dNTPs (dATP, dCTP, dTTP, dGTP), 2 pi 
of Clone 3359808H1 mini-prep DNA (approx. 0.125 lig/pl), 0.3 pi of Primer 

30 LW1326 (1 pg/pl), 0.3 pi of Primer LW1327 (1 pg/pl), and 0.5 pi of High Fidelity 

Taq polymerase (Boehringer Mannheim). The PCR reaction was started with 1 cycle 
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or94X for 2 minutes; followed by 12 cycles at 94''C for 30 seconds, 55°C for 30 

seconds, and 72°C for I minute. 

The contents from the PGR reaction were loaded onto a 1 .2% agarose 

gel and electrophoresed. The DNA band of expected size was excised from the gel, 
5 placed in a GenElute Agarose spin column (Supeico) and spun for 1 0 minutes at 

maximum speed in a microcentrifuge. The eluted DNA was ethanol-precipitated and 

resuspended in 6 ^1 HjO for ligation. 

The purified PGR fragment containing the GON198 coding sequence 

was hgated into a commercial vector using Invitrogen's Original TA Cloning Kit. The 
10 ligation reaction was carried out as described above for GON193 in Example 1 A.3. 

The resulting plasmid DNA was isolated using a Concert Rapid Plasmid Miniprep 

System (GibcoBRL) and sequenced to confirm that the plasmid contained the 

CON 1 98 insert. Sequencing of the subcloned CON 198 construct revealed that the 

PGR amplification had introduced a mutation (relative to the sequence of the original 
15 clone) at the nucleotide corresponding to position 204 of SEQ ID NO: 9. A 

site-directed mutagenesis experiment was performed using the QuikChange 

Site-Directed Mutagenesis Kit (Stratagene) to repair the mutation. 

Two primers were designed to revert the mutated A nucleotide at 

position 204 back to a G nucleotide via polymerase chain reaction. Primer LWl 415 
20 (SEQ ID NO: 44) contained the sequence: 

5*-CCATGTATATATTTCTTTGCATGCTTTCAGGCATTGACATCC-3';and 

primer LW1416 (SEQ ID NO: 45) contained the sequence: 

5'-GGATGTCAATGCCTGAAAGCATGCAAAGAAATATATACATGG-3\ The 
PGR reaction contained 40 of HjO, 5 p.1 of I Ox Reaction buffer, I \i] of mini-prep 

25 DNA (approx. 0.125 \ig/\i\) from the CON198-pCR2.l clone (as template), 1 ^1 of 

primer LW1415 (125 ng/pl), 1 ^Ll of primer LW1416 (125 ng/^l), 1 ^il of 10 mM 
dNTPs, 1 ^1 Pfu DNA polymerase. The PGR cycle conditions were as follows: initial 
denaturation at 95^G for 30 seconds, then 14 cycles at 95°G for 30 seconds, 55°C 
annealing for I minute, and 68°G extension for 12 minutes. Thereafter, the reaction 

30 tube was placed on ice for 2 minutes. 
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Afler PCR, I |il of Dpn\ was added and the tube incubated at ll^'C for 
one hour to digest the methylated parental DNA template. One microliter of the 
D/7/J [-treated DNA was transfomied into Epicurian coll XLl-Blue supercompetent 
cells and the entire insert was re-sequenced. The resequencing confimied that 
5 position 204 of SEQ ID NO: 9 had been successfully reverted to a guanine nucleotide. 

Upon confirmation of the insert, the £. coli transformant was used to 
inoculate a 50 ml culture of LB medium. The culture was grown for 16 hours at 
3TC, and centrifuged into a cell pellet. Plasmid DNA was purified from the pellet 
using a Qiagen Plasmid Midi Kit and again sequenced to confirm successful cloning 
10 of the CON 1 98 insert, using an ABI377 fluorescence-based sequencer and the ABl 
PRISM™ Ready Dye-Deoxy Terminator kit with Taq FS™ polymerase as described 
abvoe for CON 193 in Example lA.l. 

F. Cloning of CON197 G Protein-Coupled Receptor 
15 . F.l Database Search Results 

The database searching identified Clone 866390H1 in the Incyle 
database as an interesting candidate sequence. The 866390H1 clone was obtained and 
sequenced using standard techniques. From the sequence it was deduced that Clone 
866390H1 contained the entire coding region for a previously unidentified GPCR, 

20 which was designated "CON197.*' The DNA and deduced amino acid sequences for 
CON197 are set forth in SEQ E) NOs: 1 1 and 12, respectively. A database search 
with this CONl 97 DNA sequence showed a 42% match to an olfactory receptor. 
F.2 Subcloning of the Coding Region of CONl 97 via PCR 

Additional experiments were conducted to subclone the coding region 

25 of the CONl 97 clone into a useful vector. Two PCR primers were designed based on 

the coding region of CON197 for the purpose of PCR amplification of the CON197 
coding sequence. The first, Primer LWI324, from 5' to 3' (SEQ ID NO; 48): 
GATCGGATCC ATGGAAAGCGAGAACAG. includes the 5' end of the CON 197 
coding sequence (underlined) as well as a BcunUl restriction site, useful for 

30 subsequent expression work. The second, Primer LWl 325, from 5' to 3' (SEQ ID 
NO: 49): GATCCTCGAGTCA GGCTATGTGCTTATTAAACACC . includes 
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sequence complementaiy to the 3' end of the CON 1 97 coding sequence (underlined), 
preceded by a Xhol restriction site sequence useful for subsequent cloning and 
expression work. 

The PCR was performed in a 50 |i] reaction containing 24 \i\ HjO, 
5 10 p,l Rapid Dye Loading buffer (Origene) 5 jil lOX IT buffer (140 mM Ammonium 

Sulfate, 0.1% gelatin, 0.6 M Tris-tricine, pH 8.4), 5 jxl of 15 mM MgS04, 2 ^1 of 10 
mM dNTPs (dATP, dCTP, dTTP, dGTP), 3 ^1 of Clone 866390H1 mini-prep DNA 
(approx. 0.125 ^g/^l), 0.3 |al of Primer LW 1324 (1 jxg/^l), 0.3 jil of Primer LW 1325 
(1 \xg/[ilX and 0.5 ^1 of High Fidelity Taq polymerase (Bochringer Mannheim). The 
10 PCR reaction was started with 1 cycle of 94°C for 2 minutes; followed by 12 cycles at 
94°C for 30 seconds, 55'^C for 30 seconds, and 72°C for 1 minute. 

The contents from the PCR reaction was loaded onto a 1.2% agarose 
gel and electrophoresed. The DNA band of expected size was excised from the gel, 
placed in GenElute Agarose spin column (Supelco) and spun for 10 minutes at 
1 5 maximum speed in a Savant microcentrifuge. The eluted DNA was ethanol- 

precipitated and resuspended in 6 ^1 HjO for ligation. 

The purified PCR fragment containing the CON197 coding sequence 
was ligated into a commercial vector using Invitrogen's Original TA Cloning Kit. The 
resulting plasmid DNA from the culture was isolated using a Concert Rapid Plasmid 
20 Miniprep System (GibcoBRL) and sequenced to confirm that the plasmid contained 
the CONl 97 insert. 

Upon confirmation of the insert, the same transformant was used to 
inoculate a 50 ml culture of LB medium. The culture was grown for 16 hours at 
37''C, and centrifuged into a cell pellet. Plasmid DNA was purified from the pellet 
25 using a Qiagen Plasmid Midi Kit and again sequenced to confirm successfiil cloning 

of the CON197 insert, using an AB1377 fluorescence-based sequencer (Perkin 
Elmer/Applied Biosystems Division, PE/ABD, Foster City, CA) and the ABI 
PRISM™ Ready Dye-Deoxy Terminator kit with Taq FS^^ polymerase as described 
above for CON193 in Example lA.l. 



30 
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G. Cloning of CON2Q2 G Protein-Coupted Receptor 
G.l Database Search Results 

The database searching identified Clone Number 13055 13H1 in the 
hicyle database as an interesting candidate sequence. The 1 3055 13H I clone was 
5 obtained and sequenced using an AB1377 iluorescence-based sequencer (Perkin 

Elmer/Applied Biosystems Division, PE/ABD, Foster City, CA) and the ABI 
PRISM"^ Ready Dye-Deoxy Terminator kit with Taq FS^'^^ polymerase as described 
above for CON193 in Example lA. I. 

Sequencing of Tncyte Clone 130551 3H1 revealed a sequence 

, 1 0 corresponding to nucleotides 1054 to 1378 of SEQ ID NO: 13, Using a FORTRAN 

computer program called "tmtrest.all" [Parodi et ciL, Comput Appi BioscL, 5: 527- 
535 (1 994)], Clone 13055 13H1 was deduced to contain two transmembrane-spanning 
domains (TMVI and TMVII) and an extracellular loop for a previously unidentified 
GPCR, which was designated as "CON202". The sequence obtained was used as a 

15 tool to identify a full length GPCR clone as described in the next section. 

G.2 PGR Screening of Genomic Clones 

A human genomic phage library was selected as a source from which 
to attempt to clone the CON202 gene. The genomic library was amplified as 
described above for CON193 in Example 1 A. 2. 

20 This genomic library was screened by PGR using the primers: GV599 

(5'GGCAGAAGAAGGCTATTGGTCTTAGACGAG3'; SEQ ID NO: 52), and 
GV600 (5'CTGAAACAGCGCCTCAGCTCCC3'; SEQ ID NO: 53). These primers 
were designed from the sequence of Clone 13055 13H1 to amplify a 253 base pair 
fragment (corresponding to nucleotides 1064 to 1317 of SEQ ID NO: 13) from any 

25 corresponding genomic clone in the library. The 20 fil PGR reactions each contained 

12.8 pi of HjO, 2 pi of lOx PGR buffer 11 (Perkin-EImer), 2 pi of 25 mM MgClj, 0,8 
pi of 10 mM dNTP's (dATP, dGTP, dCTP, dTTP), 0, 12 pi of primer GV599 
(1 pg/ml), 0.12 pi of primer GV600 (1 pg/ml), 0.2 fxl AmpliTaq Gold polymerase (5 
Units/pl. with "Units" as defined by the supplier, Perkin Elmer) and 2 pi of phage 

30 from one of the 24 tubes. The PGR reaction consisted of 1 cycle at 95*'C for 10 

minutes; then 1 7 cycles at 95^*0 for 20 seconds, 72T for 2 minutes decreasing 1 **C 



wo 01/31014 



PCT/USOO/29601 



-81 - 

each cycle, 7TC for 30 seconds followed by 30 cycles at 95"C for 20 seconds, 55*^0 

for 30 seconds, and 72"C for 30 seconds. 

The PGR products were visualized on a 2% agarose gel. For those 

tubes which produced the correct sized band of 253 bp, five microliters from each 
5 original phage culture tube were used to amplify the CON202 genomic phage DNA as 

described above for CON 193 in Example 1 A.2. 

The genomic DNA from the single phage isolate, was sequenced with 

the ABl PRISM™ 310 Genetic Analyzer (P£ Applied Biosystems) which uses 

advanced capillary electrophoresis technology and the ABl PRISM'^'^^ Big Dye™ 
10 Terminator Cycle Sequencing Ready Reaction Kit. The cycle-sequencing reaction 

contained 20 ml of H2O, 16 ml of BigDye™ Terminator Mix, 1 ml of genomic phage 

DNA (1 . 1 mg/rnl), and 3 ml primer (25 ng/ml). The reaction was performed in a 

Perkin-EImer 9600 thermocycler at 95T for 5 minutes, followed by 99 cycles of 95**C 

for 30 seconds, SS^'C for 20 seconds and 60T for 4 minutes. The product was 
1 5 purified using a Centriflex™ gel filtration cartridge, dried under a vacuum, then 

dissolved in 16 ml of Template Suppression Reagent. The samples were heated at 

95'^C for 5 minutes then placed in the 310 Genetic Analyzer. 

G.3 Subcloning of the Coding Region of CON202 via FCR 

Additional experiments were conducted to subclone the coding region 
20 of the CON202 clone into a more useful vector. Two PCR primers were designed 

based on the coding region of CON202 for the purpose of PCR ampHfication of the 

CON202 coding sequence. The first, Primer LW1482 

f5^AGC TATGGCGAACTATAGCCATGCAGC 3^: SEQ ID NO: 54) included the 5' 
end of the CON202 coding sequence (underiined). The second, Primer LW148 

25 (5'AGTCC TCATATAACACAGTAAGGTTCC 3^ SEQ ID NO: 55) included the 

sequence complementary to the 3' end of the CON202 coding sequence (underiined). 

The PCR was perfonned in a 50 p.1 reaction containing 36.5 |xl of HjO, 
5 \i\ of lOx TT buffer (140 mM Ammonium Sulfate, 0. 1% gelatin, 0.6 M Tris-tricine, 
pH 8.4), 5 ^1 of 1 5 mM MgS04, 2 ^1 of 1 0 mM dNTP's (d ATP, dCTP, dTTP, dGTP), 

30 0.5 III of CON202 genomic phage DNA (approx. 1.1 ^Lg/^l), 0.3 |xl of Primer LW 1482 

(1 \ig/\i\), 0.3 \i\ of Primer LW1483 (1 ^ig/fjil), and 0.4 \il of High Fidelity Taq 
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polymerase (Boehringer Mannheim). The PGR reaction was started with 1 cycle of 
94°C for 2 minutes; followed by 12 cycles at 94°C for 30 seconds, 55^C for 30 
seconds, and 72°C for 1 .3 minutes. 

The contents from the PGR reaction were loaded onto a 2. 1 % agarose 
5 gel and eiectrophoresed. The DN A band of expected size (1.1 kb) was excised from 

the gel, placed on a GenElule Agarose spin column (Supelco), and spun for 10 
minutes at maximum speed in a microfuge. The eluted DNA was elhanol-precipitated 
and resuspended in 6 \i\ of HjO for ligation. 

The purified PGR fragment, containing the CON202 coding sequence, 

10 was li'gated into a commercial vector using Invitrogen's Original TA Cloning Kit. The 

ligation reaction was carried out as described above for GON193 in Example 1 A.3. 
The resulting plasmid DNA from the culture was isolated using a Goncert Rapid 
Plasmid Miniprep System (GibcoBRL) and sequenced to confirm that the plasmid 
contained the GON202 insert. The resuhing construct was denoted as pGR-CON202. 

1 5 The final subclone was sequenced using the ABI PRISM™ 3 1 0 

Genetic Analyzer (PE Applied Biosystems) which uses advanced capillary 
electrophoresis technology and the ART PRISM™ Terminator Cycle Sequencing 
Ready Reaction Kit. The cycle-sequencing reaction contained 6 ml of H20, 8 mi of 
BigDye™ Terminator mix, 5 ml miniprep DNA (0.1 mg/ml), and 1 ml primer (25 

20 ng/ml). The reaction was performed in a Perkin-Elmer 9600 thermocyclcr at 25 
cycles of 96°C for 10 seconds, 50^G for 10 seconds, and 60X for 4 minutes. The 
product was purified using Centriflex'^'^ gel filtration cartridges, dried under vacuum, 
then dissolved in 16 ml of Template Suppression Reagent. The samples were heated 
to 95"C for 5 minutes then placed in the 3 10 Genetic Analyzer. 

25 Upon confirmation of the insert, the same transformant was used to 

inoculate a 50 ml culture of LB medium. The culture was grown for 16 hours at 
37°G, and centrifuged into a cell pellet. Plasmid DNA was purified from the pellet 
using a Qiagen Plasmid Midi Kit and again sequenced to confirm successful cloning 
of the CON202 insert, as described above. 
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H. Cloning of CON222 G Protein-Coupled Receptor 
H.l Database Search Results 

The database searching in the Incyte database identified Sequence 
Number 2488822CB1 as an interesting candidate sequence. This Incyte sequence is a 
5 consensus sequence derived by compiling multiple, shorter contiguous (apparently 

overlapping) partial sequences from cDNA clones. A single clone known to contain 
the complete consensus sequence was not available from Incyte. The following 
experiments were performed to clone a piece of human DNA which corresponds to 
the region of the theoretical Incyte Sequence Number 2488822CB that was deduced 

10 to encode a heretofore undescribed GPCR. The human DNA and protein that was 
eventually isolated is referred to herein as CON222. 
H.l Isolation of CON222 Genomic DNA using PCR 

To isolate a clone of CON222, PCR primers were designed based on 
the 5' and 3' ends of the open reading frame that was identified in the Incyte Sequence 

1 5 Number 2488822CB 1 . The first primer, designated as LW 1440, has the sequence 
5^AAGCGG ATGTTTAGACCTCTTGTG 3^ (SEQ ID NO: 60) which corresponds to 
nucleotides I to 18 of SEQ ID NO: 15 (underlined). The second primer, designated 
LW1441, has the sequence S^AACAG TCATGAATAGGAATTGAG B^ (SEQ ID NO: 
61) which is the reverse complement of nucleotides 1 173 to 1 191 of SEQ ID NO: 1 5 

20 (underlined). 

PCR was performed in a 50 ml reaction containing 22.1 ml HjO, 10 ml 
Rapid Dye Loading Buffer (Origene), 5 ml 1 Ox TT buffer (140 mM Ammonium 
Sulfate, 0.1% gelatin, 0.6 M Tris-tricine pH 8.4), 5 ml 15 mM MgS04, 2 ml 10 mM 
dNTP's (dATP, dCTP, dGTP, dTTP), 5 ml human genomic DNA (0.03 mg/ml) 

25 (Clontech, Cat# 6550-1), 0.3 ml of Primer LW1440 (1 mg/ml) (SEQ ID NO: 59), 0.3 

ml of LW1441 (1 mg/ml) (SEQ ID NO: 60), 0.4 ml High Fidelity Taq polymerase 
(Boehringer Mannheim). The PCR reaction was started with 1 cycle of 94"C for 2 
minutes followed by 10 cycles at 94"C for 30 seconds, 55^C for 2 minutres, 72^C for 2 
minutes then 25 cycles at 94X for 30 seconds, 55**C for 30 seconds, and 72^C for 2 

30 minutes. The PCR reaction was loaded onto a 1 .2% agarose gel. The resulting band 
was not 1 .2 kB in length as expected, indicating that this method was unsuccessful in 
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identi Tying an appropriate clone from the selected Clontech genomic DNA library 
containing the coding region of CON222. 

A human genomic DNA phage library was selected as an alternate 
source from which to attempt to clone the CON222 gene. Internal primers were 
5 designed to attempt to isolate from a genomic library a single phage which expresses 

the complete coding region. The procedure was carried out as described above for 
CON 193 in Example 1A.2. 

PGR was performed to identify a phage that contained a genomic 
DNA insert which corresponds to the deduced complete coding region of Tncyte 

10 Sequence Number 2488822CB1 using the primers: Primer LW1442: 

5'GCCATTCTGTCCACAGAAG3' (SEQ ID NO: 58; see nucleotides 391 to 410 of 
SEQ ID NO: 15) and Primer LW1443: 5'TCAGTTGCTGTTATGGCAC3' (SEQ ID 
NO: 59; see reverse complement of nucleotides 744 to 761 of SEQ ID NO: 15). 
These primers were designed based on the deduced coding region of Incyte Sequence 

1 5 Number 2488822CB 1 , to ampHfy a 370 bp fragment (corresponding to nucleotides 

391 to 761 of SEQ ID NO: 1) from any corresponding genomic clone in the library. 
The 50 \il PGR reactions each contained 32 ^l of HjO, 5 |al of lOx PGR gold buffer 
(PE Applied Biosystems), 5 ^1 of 25 mM MgGl,, 2 ^1 of 10 mM dNTP's (dATP, 
dCTP, dGTP, dTTP), 0.3 fil of primer LW1442 (1 [ig/ml), 0.3 ^il of primer LW1443 

20 (l^g/ml), 0.4 111 AmpliTaq Gold polymerase (5 U/^il, with "Units" defined by the 

supplier; PE Applied Biosystems) and 5 ^il of phage isolated human genomic DNA 
(0.03 \ig/\i\y The PGR reaction consisted of 1 cycle at 95*^0 for 10 minutes, then 17 
cycles at 95°G for 20 seconds and 72''G for 2 minutes decreasing 1 degree each cycle, 
and 72T for 1 minute, followed by 30 cycles at 95T for 20 seconds, 55°G for 30 

25 seconds, and 72°G for 1 minute. An aliquot of the PGR reaction was loaded onto a 

1 .2% agarose gel and electrophoresed. Although the internal primers were designed 
to produce a 370 bp PGR fragment, the resulting band was approximately 1 A kb in 
length. 

The DNA band was excised from the gel, placed on GenElute Agarose 
30 spin columns (Supelco) and spun for 1 0 minutes at maximum speed in a 
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microcentrifuge. The eluted DNA was ethanol-precipilatecl and resuspended in 10 
of H2O and 5 \x\ was used to sequence the PCR band. 

The PCR fragment was sequenced with an ABI PRISM™ 310 Genetic 
Analyzer (PE Applied Biosystems) which uses advanced capillary electrophoresis 
technology and the ABT PRISM™ BigDye™ Terminator Cycle Sequencing Ready 
Reaction Kit. Each cycle-sequencing reaction contained 6 ml of 14,0, 8 ml of BigDye 
Terminator mix, 5 ml PCR fragment DNA (0.2 mg/ml), and 1 mi Primer LW1442 
(25 ng/ml) and Primer LW1443 (25 ng/ml). The reaction was performed in a Perkin- 
Ehiier 9600 thennocycler with 25 cycles of 96"C for 10 seconds, 50"C for 10 seconds, 
and 60"C for 4 minutes. The product was purified using Centriflex'^*^ gel Reagent (PE 
Applied Biosystems). The samples were heated at 95'^C for 5 minutes then placed in 
the 310 Genetic Analyzer. 

The sequence analysis determined that there is an intron in the middle 
of the 5th transmembrane-spanning domain between nucleotides 673 and 674 in SEQ 
ID NO: 1 5. This intron was responsible for the unexpectedly large PCR fi-agment. 
H.3 Isolation of Full Length cDNA 

Since attempts to isolate an uninterrupted coding region from genomic DNA 
were unsuccessful, a fetal brain cDNA was used to generate the complete coding 
region of Incyte Sequence Number 2488833CB1 . The PCR primers described above, 
LW1440 (SEQ ID NO: 60) and LW1441 (SEQ ID NO: 61), which correspond to the 
5' and 3' end of CON222 respectively, were used to generate the full length coding 
region. 

The 50 ^il PCR reaction contained 37.4 ^il of HA 5 pi of lOx cDNA 
PCR buffer (Clontech), 1 pi of 10 mM dNTP's (dATP, dCTP, dTTP, dGTP), 5 ^1 of 
Marathon-Ready Fetal Brain cDN A (Clontech), 0.3 pi of Primer LW1440 (1 pg/pl), 
03 pi of Primer LW1441 (1 pg/pl), and 1 pi of 50x Advantage cDNA polymerase 
(Clontech). The PCR reaction was started with I cycle of 94''C for 1 minute, 
followed by 30 cycles at 94°C for 30 seconds, 50X for 30 seconds, and 68°C for 3 
minutes. 

The contents from the PCR reaction were loaded onto a 1 .2% agarose 
gel and electrophoresed. The DNA band of expected size (1 .2 kb) was excised from 
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Ihe gel, placed on a GenElute Agarose spin column (Supelco), and spun for 10 
minutes at maximum speed in a microfuge. The ekited DNA was ethanol-precipitated 
and resuspended in 6 [i\ H^O for ligation. 
W.4 Subcloning of Coding Region of CON222 via PCR 
5 After a cDNA containing the full length CON222 open reading frame 

was obtained, the coding region of CON222 was then subcloned into a more useful 
vector as follows. 

The purified PCR fragment described above, containing the CON222 
coding sequence, was ligated into a commercial vector using Invilrogen's Original TA 

10 Cloning Kit. The ligation reaction was carried out as described above for CON 1 93 in 
Example 1 A,3. The resulting plasmid DNA from the culture was isolated using a 
Concert Rapid Plasmid Miniprep System (GibcoBRL) and sequenced to confirm that 
the plasmid contained the CON222 insert. 

The subcloned insert in pCR2.1 was sequenced using the ABl 

15 PRISM'^'^ 310 Genetic Analyzer (PE Applied Biosystems) which uses advanced 

capillary technology and the ABI PRISM ™ BigDye™ Terminator Cycle Sequencing 
Ready Reaction Kit. Each cycle-sequence reaction contained 6 mi of H2O, 8 ml of 
BigDye™ Terminator mix, 5 ml mini-prep DNA (0.1 mg/ml), and 1 ml of primer (25 
ng/ml) and was performed in a Perkin-Elmer 9600 thermocycler with 25 cycles of 

20 96''C for 10 seconds, 50°C for 10 seconds, and 60T for 4 minutes. The product was 
purified using a Centriflex™ gel filtration cartridge, vacuum dried and dissolved in 16 
ml of Template Suppression Reagent (PE Apphed Biosystems). The samples were 
heated at 95''C for 5 minutes then placed in the 3 10 Genetic Analyzer. 

Upon confirmation of the insert, the same transformant was used to 

25 inoculate a 50 ml culture of LB medium. The culture was grown for 16 hours at 

37°C, and centrifuged into a cell pellet. Plasmid DNA was purified from the pellet 
using a Qiagen Plasmid Midi Kit and again sequenced to confirm successful cloning 
of the CON222 insert, as described above. 



30 
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1. Cloning of CON215 G Protein-Coupled Receptor 
1.1 Database Search Results 

The database searching iclenlified Clone 1452259H1 in the Lncyte 
database as an interesting candidate sequence. The sequence from 1452259H1 clone 
5 was used to search the lncyte fill-length database and matched the entry 1650519CB 1 . 

An inspection of the clones that made up 1650519CBI indicated that lncyte Clone 
27961 57H1 probably contained the full-length coding region. Sequence analysis of 
lncyte Clone 27961 57H1 indicated that it contains the entire coding region for a 
previously unidentified GPCR, which was designated "CON215", along with 12 
10 nucleotides of 5' untranslated region, 63 nucleotides of 3' untranslated region and a 

poly A^tail. The DNA and deduced amino acid sequences for CON215 are set forth 
in SEQ FD NOS: 17 and 1 8, respectively. A database search with this CON21 5 
sequence showed a 47% match to the human probable G protein-coupled receptor 
KlAOOOl. 

1 5 Since the untranslated regions were relatively short, it was not 

necessary to remove the coding region of CON215 from the pESfCY vector (lncyte) 
and the construct is referred to as plNCY-CON215. The lncyte Clone 2796 157H 1 
was sequenced using the ABI PRISM'^'^ 310 Genetic Analyzer (PE Applied 
Biosystems) which uses advanced capillary electrophoresis technology and the ABI 

20 PRISM'^M BigDye™ Terminator Cycle Sequencing Ready Reaction Kit as described 

above for CON222 in ExamplelH.4. 

J. Cloning of CON217 G Protein-Coupled Receptor 
J.l Database Search Results 

25 The lncyte database search identified EST 3700658H1 as an interesting 

candidate sequence. The EST sequence No. 3700658H1 was used to search the lncyte 
full length database. This search identified lncyte clone No. 33561 66H1 as a clone 
that potentially contained a full length GPCR corresponding to the selected EST. 

The 33561 66H1 clone was obtained from hicyte and sequenced using 

30 an ABI377 fluorescence-based sequencer ( and the ABI PRISM"^*^^ Ready Dye-Deoxy 
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Terminalor kit with Taq FS''""^ poiyiiierase as described above for CON 193 in 
Example I A. I. 

Sequencing of Incyte Clone No. 3356 1 66H I revealed a 2480 basepair 
sequence as shown in SEQ NO: 1 9. Using a FORTRAN computer program called 
5 "tmtrest.all" [Parodi et ai, CompuL AppL BioscL, 5: 527-535 (1994)], Clone No. 

3356166H1 was deduced to contain seven transmembrane-spanning domains (TMl- 
TMVH) and was designated as "CON217" (SEQ ID NO: 20). The following 
experiments were performed to subclone and isolate the full length coding sequence 
of CON217 from Incyte Clone No. 3356 1 66H 1 . 

1 0 J.2 Subcloning of the Coding Region of GPCR21 7 

To subclone the full length coding sequence of CON217, PCR primers 
were designed based on the 5' and 3' ends of the open reading frame that was 
identified in the Incyte Clone No. 3356166H1. The first primer, designated as 
LW1448, has the sequence 5^AAGCGGTACC ATGTTAGCCAACAGCTCCTC 3^ 

1 5 (SEQ ID NO: 66) which corresponds to nucleotides 42 to 62 of SEQ ID NO: 19 

(underlined). The second primer, designated LW1449, has the sequence 
5^AAGCTCTAGA TCAGAGGGCGGAATCCTGG 3^ (SEQ ID NO: 67) which is the 
reverse complement of nucleotides 1 142 to 1 160 of SEQ ID NO: 20 (underlined). 
The primers also include recognition sequences (bold) for the restriction enzymes 

20 Kpnl and Xbal, respectively. 

PCR was performed in a 50 ml reaction containing 32.5 ml of H2O, 5 
ml of lOx Pfx Amplification buffer (GibcoBRL), 5 ml of lOx PCR Enhancer solution 
(GibcoBRL), 1.5 ml of 50 mM MgSO^, 2 ml of 10 mM dNTP's (dATP, dCTP, dGTP, 
dTTP), 3 ml 3356166H1 mini-prep DNA (0.125 mg/ml obtained with the Concert 

25 Rapid Plasmid Miniprep System; GibcoBRL), 0.3 ml of Primer LWl 448 (1 mg/ml) 

(SEQ ID NO: 3), 0.3 ml of Primer LW1449 (1 mg/ml) (SEQ ID NO: 4), 0.5 ml 
Platinum Pfx DNA polymerase (2.5 U/ml; GibcoBRL), The PCR reaction was started 
with 1 cycle of 94"C for 2 minutes followed by 25 cycles at 94X for 30 seconds, 55^*0 
for 30 seconds, 68°C for 1 .3 minutes. 

^0 The contents from the PCR reaction were loaded onto a 1 .2% agarose 

gel and electrophoresed. The DNA band of expected size (-1.1 kb) was excised from 
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the gel, placed on a GenEltite Agarose spin column (Supelco), and spun for 10 
minutes al maximum speed in amicrofuge. The eluted DNA was eUianoI-precipitated 
and resuspended in 6 \i\ of HjO for ligation. 

The purified PGR fragment, containing the C0N2 1 7 coding sequence, 
5 was ligated into a commercial vector designated pCR2. 1 using Invitrogen's Original 

TA Cloning Kit. The ligation reaction was carried out as described above for 
CON 193 in Example I A.3. The resulting plasmid DNA from the culture was isolated 
using a Concert Rapid Plasmid Miniprep System (GibcoBRL) and sequenced to 
confmn that the plasmid contained the CON217 insert and to confinn that no errors 
10 were introduced during PGR amplification. The resulting construct was denoted as 
pCR-CON217. 

The final subclone was sequenced using the ABI PRISM™ 310 
Genetic Analyzer (PE Applied Biosysteras) which uses advanced capillary 
electrophoresis technology and the ABI PRISM™ Terminator Cycle Sequencing 
1 5 Ready Reaction Kit as described above for CON222 in Example 1H,4. 

EXAMPLE 2 
Analysis of G Protein-Couplcd Receptor Sequence 
A. CQN193 

20 The DNA and deduced amino acid sequence for CON193 are 

set forth in SEQ ID NOS: 1 and 2, respectively. Beginning with the initiation codon 
(methionine), the CON193 genomic Clone contains an open reading fi:ame of 963 
nucleotides encoding 321 amino acids, followed by a stop codon. Using a FORTRAN 
computer program called "tmtrest.all" [Parodi et aL, CompuL Appl BioscL, 5: 527- 

25 535 (1994)], CON193 was shown to contain seven transmembrane- spanning domains 

corresponding to residues 30-49 (ITM), 61-81 (2TM), 103-122 (3TM), 146-165 
(4TM), 199-222 (5TM), 243-262 (6TM), and 270-295 (7TM) of SEQ tt) NO: 2. 
These transmembrane domains define first ("N-terminal," residues 1-29), second 
("first EC loop," residues 82-102), third ("second EC loop," residues 166-198), and 

30 fourth ("third EC loop," residues 263-269) extracellular domains, as well as first 

("first TC loop," residues 50-60), second ("second iC loop," residues 123-145), third 
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("Ihird !C loop;' residues 223-242), and fourth ("C-lenninal," residues 296-321) 
intracellular domains. 

Inspection of the CON193 amino acid sequence (SEQ ID NO: 2) 
reveals that this GPCR contains a DRY sequence following the third transmembrane 
5 domain (3TM) and a P[VY sequence found in the sixth transmembrane domain 

(TM6). in addition, the CON193 polynucleotide sequence was compared to 
sequences of known genes. CON193 is 45% identical and 72% similar to the mouse 
olfactory receptor gene S 19 [see Malnic et al., Cell 96:713-723 (1999)]. This level of 
sequence similarity suggests that CON 193 is a novel GPCR. 
10 The CON193 cDNA clone (SEQ ID NO: I) was deposited with the 

National Center for Agricultural Utilization Research at the United States Department 
of Agriculture 1815 North University Street, Peoria, Illinois 61604 in accordance with 
the Budapest Treaty on January 18, 2000. The clone was given accession no. B- 
30250. 

15 

B. CON166 

The DNA and deduced amino acid sequence for CON166 are set forth 
in SEQ ID NOS: 3 and 4, respectively. Beginning with the initiation codon 
(methionine), the CONl 66 genomic clone contains an open reading fi-ame of 1 ,01 1 

20 nucleotides encoding 337 amino acids, followed by a stop codon. Using a 

FORTRAN computer program called "tnitrest.all" [Parodi et ai, Comput. Appl 
BioscL, 5: 527-535 (1994)], CON166 was shown to contain seven transmembrane- 
spanning domains corresponding to the following residues presented in SEQ ID NO: 
4: ITM (30-49), 2TM (59-79), 3TM (99-119), 4TM (141-161), 5TM (191-215), 6TM 

25 (23 1-25 1), and 7TM (277-296) . These transmembrane domains define first ("N- 

terminal," residues 1 -29), second ("first EC loop " residues 80-98), third ("second EC 
loop," residues 162-190), and fourth ("third EC loop," residues 252-276), 
extracellular domains as well as first ("first IC loop," residues 50-58), second 
("second IC loop " residues 120-140), third ("third IC loop," residues 216-230), and 

30 fourth ("C-terminal," residues 297-337) intracellular domains. 
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Inspection of the CON 166 amino acid sequence (SEQ ID N0;2) 
reveals that this GPCR contains an FRC sequence following the third transmembrane 
domain (3TM), which is typically occupied by a consensus DRY sequence in other 
GPCRs; a PLLY sequence is also found in the seventh transmembrane domain (7TM). 
5 in addition, the CON 1 66 polynucleotide sequence was compared to sequences of 

known genes. CON 166 is 44% identical and 62% similar to a T-cell-specific G 
protein-coupled receptor of Gallus gallus found in the TREMBL database (Accession 
No. L06109). This level of sequence similarity suggests that CON166 is a novel 
GPCR. 

10 The CON 166 cDNA clone (SEQ ID N0:3) was deposited with the 

National Center for Agricultural Utilization Research at the United States Department 
of Agriculture 1815 North University Street, Peoria, Illinois 61604 in accordance with 
the Budapest Treaty on January 18, 2000. The clone was given accession no. B- 
30248. 

15 

C. CON103 

The DN A and deduced amino acid sequence for CON 103 are set forth 
in SEQ ID NOS: 5 and 6, respectively. Beginning with the initiation codon 
(methionine), the CONl 03 genomic clone contains an open reading frame of 1 ,1 52 

20 nucleotides encoding 384 amino acids, followed by a stop codon and a short open 

reading frame (SEQ ID NO: 5). Using a FORTRAN computer program called 
"tmtrest.all" [Parodi et aL, CompuL Appl. BioscL, 5: 527-535 (1994)], CON103 was 
shown to contain seven transmembrane-spanning domains corresponding to the 
following residues in SEQ ID NO: 6: 54-77 (ITM), 89-108 (2TM), 134-149 (3TM), 

25 167-1 88 (4TM), 216-240 (5TM), 258-283 (6TM), and 301-320 (7TM). These 

transmembrane domains define first CTsf-terminal" residues 1-53), second ("first EC 
loop," residues 109-133), third ("second EC loop," residues 189-215), and fourth 
("third EC loop," residues 284-300) extracellular domains, as well as first ("first IC 
loop," residues 78-88), second ("second IC loop," residues 150-166), third ("third IC 

30 loop," residues 241-257), and fourth ("C-terminal," residues 321-384) intracellular 

domains. 
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Inspection of llie CON 103 amino acid sequence (SEQ D NO: 6) 
reveals that this GPCR contains an NRY sequence following the third transmembrane 
domain (3TM), which is typically occupied by a consensus DRY sequence in other 
GPCRs. hi addition, the CON 103 polynucleotide sequence was compared to 
5 sequences of known genes. CON 103 is 36% identical to GPR31 (GenBank Accession 

No. U65402) and 31% identical to the P2Y1 purinergic receptor (GenBank Accession 
No. S81950). This level of sequence similarity indicates that CONI03 is a novel 
GPCR. 

The CON 103 cDNA clone (SEQ ED NO:5) was deposited with the 
10 National Center for Agricultural Utilization Research at the United States Department 

of Agriculture 1815 North University Street, Peoria, Illinois 61604 in accordance with 
the Budapest Treaty on January 18, 2000. The clone was given accession no. B- 
30247. 

15 D. CON203 

The DNA and deduced amino acid sequence for CON203 are set forth 
in SEQ ID NOS: 7 and 8, respectively. Beginning with the initiation codon 
(methionine), the CON203 genomic clone contains an open reading frame of 999 
nucleotides encoding 333 amino acids, followed by a stop codon. Using a FORTRAN 

20 computer program called "tmtrest.all" [Parodi et al, Comput Appl. BioscU 5: 527- 
535 (1994)], CON203 was shown to contain seven transmembrane-spanning domains 
corresponding to the following residues of SEQ ID NO: 7: nucleotides 29-53 (ITM), 
63-82 (2TM), 97-1 1 8 (3TM), 136-160 (4TM), 189-21 1 (5TM), 232-252 (6TM), and 
281-300 (7TM). These transmembrane domains define first ("N-terminal," residues 

25 1 -28), second ("first EC loop," residues 83-96), third ("second EC loop," residues 

161-188), and fourth ("third EC loop," residues 253-280) extracellular domains, as 
well as first ("first IC loop," residues 54-62), second ("second IC loop," residues 1 19- 
135), third ("third IC loop " residues 212-23 1), and fourth ("C-terminal," residues 
301-333) intracellular domains. 

30 Inspection of the CON203 amino acid sequence (SEQ ID NO: 8) 

reveals that this GPCR contains a DRF sequence following the third transmembrane 
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domain (3TM), which is typically occupied by a consensus DRY sequence in other 
GPCRs; CON203 also exhibited a^PLlY sequence in the seventh transmembrane 
domain (7TiVI). In addition, the COM203 polynucleotide sequence was compared to 
sequences of known genes. CON203 is 33% identical to a platelet activating receptor 
5 (GenBank Accession No. AF002986. This level of sequence similarity suggests that 

CON203 is a novel GPCR. 

The CON203 cDNA clone (SEQ ID NO: 7) was deposited with the 
National Center for Agricultural Utilization Research at the United States Department 
of Agriculture 1815 North University Street, Peoria, Illinois 61604 in accordance with 
1 0 the Budapest Treaty on January 1 8, 2000. The clone was given accession no, B- 
30254. 

E. CQN198 

The DNA and deduced amino acid sequence for CON198 are set forth 

15 in SEQ ID NO: 9 and 10 respectively. Beginning with the initiator methionine, the 
CON198 genomic clone contains an open reading frame of 954 nucleotides encoding 
318 amino acids, followed by a stop codon. It will be appreciated that residue 2 of 
SEQ ID NO: 10 also is a methionine. Amino-terminal sequencing of purified native 
or recombinant CON198 protein will provide an indication as to which methionine 

20 acts as an initiator codon in vivo. Using a FORTRAN computer program called 

"tmtrest.all" [Parodi et aL, Comput . Appl Biosci., 5: 527-535 (1994)], CON198 was 
deduced to contain seven transmembrane-spanning domains corresponding to residues 
28-52 (TMl), 61-80 (TM2), 104-123 (TM3), 147-167 (TM4), 200-226 (TM5X 239- 
263 (TM6), and 274-295 (TM7) of SEQ ID NO: 10 . These transmembrane domains 

25 define first (*Ts[-terminal " residues 1-27 or 2-27), second ("first EC loop," residues 

81-103), third ("second EC loop," residues 168-199), and fourth ("third EC loop," 
residues 264-273) extracellular domains as well as first ("first IC loop," residues 53- 
60), second ("second IC loop," residues 1 24-1 46), third ("third IC loop," residues 
227-238), and fourth ("C-terminal," residues 296-318) intracellular domains. 

30 CONl 98 contains a DRY sequence following the third transmembrane 

domain (TM3), a feature that is conserved in most GPCR, The most similar sequence 



wo 01/31014 



PCT/USOO/29601 



-94- 

in a public database, at the time of initial screening, was thai of rat GPCR RAIc, 
which shared only 61% identity at the amino acid level. 

The CON 1 98 cDNA clone (SEQ ID NO: 9) was deposited with the 
National Center for Agricultural Utilization Research at the United States Department 
5 of Agriculture 1815 North University Street, Peoria, Illinois 616G4 in accordance with 

the Budapest Treaty on January 1 8, 2000. The clone was given accession no. B- 
30252. 

F. CON197 

10 The DNA and deduced amino acid sequence for CON 197 are set forth 

in SEQ ID NO: 1 1 and 12, respectively. Beginning with the initiator methionine, the 
CON 197 genomic clone contains an open reading frame of 921 nucleotides encoding 
307 amino acids, followed by a stop codon. Using a FORTRAN computer program 
called "tmtrest.all" [Parodi etal, Comput Appl Blosci., 5: 527-535 (1994)], CON197 

15 was deduced to contain seven transmembrane-spanning domains corresponding to 

residues 23-47 (TMl), 58-78 (TM2), 99-120 (TM3), 142-164 (TM4), 195-219 (TM5X 
237-258 (TM6), and 270-289 (TM7) of SEQ ID NO: 12. These transmembrane 
domains define first ("N-terminal" residues 1-22), second ("first EC loop'Vesidues 79- 
98), third ("second EC loop'Vesidues 165-194), and fourth ("third EC loop"residues 

20 259-269) extracellular domains as well as first ("first IC loop" residues 48-57), second 

("second IC loop" residues 121-141), third ("third IC loop" residues 220-236), and 
fourth ("C-terminal" residues 290-309) intracellular domains. 

CON197 contains a DRY sequence following the third transmembrane 
domain (TM3), a feature that is conserved in most GPCR. The most similar sequence 

25 in a public database, at the time of initial screening, was that of an olfactory receptor, 

which shared only 42% identity at the amino acid level. 

The CONl 97 cDNA clone (SEQ ID NO: 1 1) was deposited with the 
National Center for Agricultural Utilization Research at the United States Department 
of Agriculture 1815 North University Street, Peoria, Illinois 61604 in accordance with 

30 the Budapest Treaty on January 18, 2000. The clone was given accession no. B- 

30251. 
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G. CON202 

The DNA and declucecl amino acid sequence for this phage insert, 
termed "CON202'\ are set forth in SEQ ID NO: 13 and 14, respectively. The 
CON202 open reading frame, as depicted in SEQ ID NO: 14, begins with the initiator 
methionine and spans 1110 nucleotides which encode 370 amino acids, followed by a 
stop codon. Since this gene was isolated froni genomic DNA and there are no 
apparent interruptions in the sequence, it is likely that CON202 contains no introns 
within the coding region. The full length clone of CON202 contained seven 
transmembrane-spanning domains corresponding to residues, 24 to 46 (TMl) , 57 to 
77 (TM2), 96 to II 7 (TM3), 135 to 159,(TM4) TMV comprises 184 to 202 (TM5), 
286 to 308 (TM6), 316 to 339 (TM7) of SEQ ID NO: 14. TM2 tenninates with PFVC 
instead of the characteristic PXXY. TM3 is followed by the sequence TRY instead of 
the characteristic DRY. These transmembrane domains define first ("N-terminal," 
residues 1-23), second ("first EC loop," residues 78-95), third ("second EC loop," 
residues 160-183), and fourth ("third EC loop," residues 309-315) extracellular 
domains as well as first ("first IC loop," residues 47-56), second ("second TC loop," 
residues 1 18-134), third ("third IC loop," residues 203-285), and fourth ("C-terminal," 
residues 340-370) intracellular domains. 

The CON202 cDNA clone (SEQ ID NO: 13) was deposited with the 
National Center for Agricultural Utilization Research at the United States Department 
of Agriculture 1815 North University Street, Peoria, Illinois 61604 in accordance with 
the Budapest Treaty on January 18, 2000. The clone was given accession no. B- 
30253. 

H. CON222 

The sequence of CON222 coding region deduced the DNA and amino 
acid sequence set forth in SEQ ID NO: 15 and 1 6, respectively. The open reading 
frame that is depicted in SEQ ID NO: 16 begins with an initiator codon and spans 
1 188 nucleotides which encode 396 amino acids, followed by a stop codon. 

The full length clone of CON222 contains seven transmembrane- 
spanning domains corresponding to residues 42-65 (TMl) 79-103, (TM2), 125-156, 
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(TM3), 167-1 88 (TM4), 21 7-241 (TM5), 268-290 (TM.6X 301-320 (TM7) of SEQ ID 
NO: 16. TiVI2 is followed by a FRC sequence and TIV17 contains a PILY sequence 
within. These transmembrane domains define first ("N-teiTninal," residues 1-41), 
second ("first EC loop," residues 1 04-124), third ("second EC loop," residues 1 89- 
5 216), and fourth ("third EC loop," residues 291-300) extracellular domains as well as 
first ("first IC loop," residues 66-78), second ("second IC loop," residues 157-166), 
third ("third IC loop," residues 242-267), and fourth ("C-tenninal," residues 321-396) 
intracellular domains. A search of the public database indicated that CON222 is 
about 35% identical to a unique GPCR found in the nervous system of Lymnaea 
10 stagnalis. 

The CON222 cDNA clone (SEQ ED NO: 15) was deposited with the 
National Center for Agricultural Utihzation Research at the United States Department 
of Agriculture 1815 North University Street, Peoria, Illinois 61604 in accordance with 
the Budapest Treaty on January 1 8, 2000. The clone was given accession no. B- 
15 30257. 

I. CON215 

The DNA and deduced amino acid sequence for CON21 5 are set forth 
in SEQ ID NO: 1 7 and 18, respectively. Beginning with the initiator methionine, the 

20 C0N21 5 genomic clone contains an open reading frame of 1074 nucleotides encoding 
358 amino acids, followed by a stop codon. Using a FORTRAN computer program 
called "tmtrest.air' [Parodi et ai, Comput . Appl BioscL, 5: 527-535 (1994)], 
CON215 was deduced to contain seven transmembrane- spanning domains 
corresponding to residues 42-66 (TMl), 81-99 (TM2), 116-137 (TM3), 156-180 

25 (TM4), 210-234 (TM5), 256-275 (TM6), and 308-328 (TM7) of SEQ ID NO: 18. 

These transmembrane domains define first ("N-terminal/' residues 1-41), second 
("first EC loop," residues 100-115), third ("second EC loop," residues 181-209), and 
fourth ("third EC loop," residues 276-307) extracellular domains as well as first ("first 
IC loop," residues 67-80), second ("second IC loop," residues 138-155), third ("third 

30 IC loop," residues 235-255), and fourth ("C-terminal," residues 329-358) intracellular 
domains. 
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CON215 contains a DRY sequence following the third transmembrane 
domain (TM3), a feature that is conserved in most GPCR. CON215 also contains a 
PlI Y sequence within the seventh transmembrane domain (TM7), 

The CON215 cDNA clone (SEQ ID NO: 1 7) was deposited with the 
5 National Center for Agricultural Utilization Research at the United States Department 

of Agriculture 1 815 North University Street, Peoria, Illinois 61604 in accordance with 
the Budapest Treaty on January 1 8, 2000. The clone was given accession no. B- 
30255. 

10 J. CON217 

The DNA and deduced amino acid sequences of CON217 are set forth 
as SEQ ID NO: 19 and 20, respectively. The open reading frame that is depicted in 
SEQ ID NO: 2 begins with an initiator methionine codon and spans 1116 nucleotides 
which encode 372 amino acids, followed by a stop codon. In addition, the nucleotide 

1 5 sequence consists of 41 bp in the 5' untranslated region and 1323 bp in the 3' 
untranslated region. 

The full length clone of CON2 1 7 contains seven transmembrane- 
spanning domains as indicated by the FORTRAN computer program "tratrest.alF' 
[Parodi et ai, Comput Appl Biosci., 5: 527-535 (1994)] which corresponds to 29-50 

20 (TMl), 57-75 (TM2), 96-1 17 (TM3), 137-160 (TM4), 188-210 (TM5), 235-258 

(TM6), 277-297 (TM7). TM3 is followed by a DRY sequence and TM7 contains a 
PLVY sequence within. These transmembrane domains define first ("N- terminal," 
residues 1-28), second ("first EC loop," residues 76-95), third ("second EC loop," 
residues 161-1 87), and fourth ("third EC loop," residues 259-276) extracellular 

25 domains as well as first ("first IC loop," residues 51-56), second ("second IC loop," 

residues 1 18-136), third ("third IC loop," residues 2 1 1-234), and fourth ("C-temiinal," 
residues 298-372) intracellular domains. A search of the public database indicated 
that CON217 is about 41% identical to GPR23 (Genebank Accession No.: U66578) 
and to a purinergic receptor P2Y9 (Genebank Accession No.: U90322). 

30 The C0N2 1 5 cDN A clone (SEQ ID NO: 1 9) was deposited with the 

National Center for Agricultural Utilization Research at the United States Department 
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of A^ricLillure 1815 Norlh University Street, Peoria, Illinois 61604 in accordance with 
the Budapest Treaty on January 1 8, 2000. The clone was given accession no. B- 
30256. 

5 K. Summary of Deposits 

The polynucleotides (SEQ K) NO: 1, 3, 5, 7, 9, 11, 13, 15 and 17) 
encoding the GPCR polypeptides of the invention were deposited with the 
Agricultural Research Sei-vice Culture Collection (NRRL) at the National Center 
Agricultural Utilization Research at the U.S. Department of the Agriculture 1815 
10 North University Street, Peoria, Illinois 61604. These deposits were made in 

accordance with the Budapest Treaty on the International Recognition of the Deposit 
of Microorganism for the Purposes of Patent Procedures. The table below lists the 
details of these deposits. 



GPCR 


SEO ID NO; 


NRRL No. 


Deposit Date 


CON193 


1 


B-30250 


1/18/00 


CON 166 


3 


B-30248 


1/18/00 


CON103 


5 


B-30247 


1/18/00 


CON203 


7 


B-30254 


1/18/00 


CON198 


9 


B-30252 


1/18/00 


CON197 


11 


B-30251 


1/18/00 


CON202 


13 


B-30253 


1/18/00 


CON222 


15 


B-30257 


1/18/00 


CON215 


17 


B-30255 


1/18/00 


CON217 


19 


B-30256 


1/18/00 



25 

EXAMPLE 3 

Hybridization Analysis Demonstrates that the GPCRs are 
Expressed in the Brain 

The expression of GPCR polynucloetides in mammals, such as the rat, 
30 was investigated by in situ hybridization histochemistry. Coronal and sagittal rat 
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brain cryoseclions (20 [xni thick) were prepared using a Reichert-Jung cryoslat. 
Individual sections were thaw-mounted onto silanized, nuclease-free slides (CEL 
Associates, Inc., Houston, TX), and stored at -80°C. Sections were processed starting 
with post-fixation in cold 4% parafonnaldehyde, rinsed in cold phosphate-buffered 
5 saline (PBS), acetylated using acetic anhydride in triethanolamine buffer, and 

dehydrated through a series of alcohol washes in 70%, 95%, and 100% alcohol at 
room temperature. Subsequently, sections were delipidated in chloroform, followed 
by rehydration through successive exposure to 100% and 95% alcohol at roonn 
temperature. Microscope slides containing processed cryosections were allowed to air 
10 dry prior to hybridization. 

A. CON 193 

A CON 193 -specific probe was generated using PCR. The probe 
consisted of a 270 bp fragment containing sequence at the 3' end of CON-193, The 

15 primers for PCR amplification were LW 1248 [5'- 

GCATGAATTCCAATATACTTCCCCATACCTAC-3'; SEQ ID NO: 26) and LW 
1249 [5'-GCATGGATCCGGAAAAGAAGGAGAAGAAAG-3'; SEQ ID NO: 27), 
which introduced terminal £coRI and BamHl restriction sites into the PCR product. 
Following PCR amplification, the fragment was digested with EcoKl and BamHl and 

20 cloned into pBluescriptll cleaved with the same enzymes. For production of a probe 

specific for the sense strand of CON193, the CON193 Clone in pBluescriptIT was 
linearized with BamRly which provided a substrate for labeled run-off transcripts {Le., 
cRNA riboprobes) using the vector-borne T7 promoter and commercially available T7 
RNA polymerase. A probe specific for the antisense strand of CON 193 was also 

25 readily prepared using the CON193 Clone in pBluescriptD by cleaving the 

recombinant plasmid with EcoRl to generate a linearized substrate for the production 
of labeled run-off cRNA transcripts using the T3 promoter and cognate polymerase. 
The riboprobes were labeled with p^S]-UTP to yield a specific activity of 0.81 x 10^' 
cpm/pmol for ajitisense riboprobes and 0.55 x 10^ cpm/pmol for sense-strand 

30 riboprobes. Both riboprobes were subsequently denatured by incubating at 70**C for 3 
minutes and added (2 pniol/ml) to hybridization buffer which contained 50% 
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fomiamide, 10% dextran, 0.3 M NnCl, 10 mM Tris (pH 8.0), 1 mM EDTA, IX 
Denhardt*s Solution, and 10 mM dilhiothreitol. Microscope slides containing 
sequential brain cryoseclions were independently exposed to 45 |il of hybridization 
solution per slide and silanized cover slips were placed over the sections being 
5 exposed to hybridization solution. Sections were incubated overnight (15-18 hours) at 

52°C to allow hybridization to occur. Equivalent series of cryosections were exposed 
to sense or antisense CON193-specific cRNA riboprobes. 

Following the hybridization period, coverslips were washed off the 
slides in IX SSC. Slides were subjected to RNase A treatment by incubation in a 

10 buffer containing 20 |ig/ml RNase A, 10 mM Tris (pH 8.0), 0.5 M NaCl and 1 mM 
EDTA for 45 minutes at 37°C. The cryosections were then subjected to three high- 
stringency washes in 0.1 X SSC at 52°C for 20 minutes each. Following the series of 
washes, cryosections were dehydrated by consecutive exposure to 70%, 95%, and 
100% ammonium acetate in alcohol, followed by air drying and exposure to Kodak 

15 BioMax MR-1 fihii. After 13 days of exposure, the film was developed. Based on 

these results, brain sections that gave rise to positive hybridization signals were coated 
with Kodak NTB-2 nuclear track emulsion and the slides were stored in the dark for 
32 days The slides were then developed and counterstained with hematoxylin. 
Emulsion-coated sections were analyzed microscopically to determine the specificity 

20 of labeling. The signal was detennined to be specific if autoradiographic grains 
(generated by antisense probe hybridization) were cleariy associated with crystal 
violet-stained cell bodies. Autoradiographic grains found between cell bodies 
indicates non-specific binding. 

Specific labeling with the antisense probe occurred at low levels in the 

25 cortex and in the substantia nigra-pars compacta (SN-c). The specificity of labeling 

was confirmed by microscopic analysis of emulsion-coated cryosecUons, as described 
above. In contrast, hybridization using the riboprobe specific for the sense strand of 
CON193 did not result in specific tissue labeling. The observed regional distribution 
of CON 193 mRNA suggests that ligands for this GPCR may be involved in signal 

30 transductions important for cellular processes underlying neurological functioning. In 

addition, expression of CON 1 93 in the brain provides an indication that modulators of 
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CON193 activity have utility for treating neurological disorders, including but not 
limited to, schizophrenia, depression, anxiety, bipolar disease, epilepsy, neuritis, 
neurasthenia, neuropathy, neuroses, and the like. Use of CON193 modulators, 
including CO]Nil93 ligands and anti-CON193 antibodies, to treat individuals having 
5 such disease states is intended as an aspect of the invention. 

B. CON166 

A CON166-specific probe was generated using PGR as described 
above for CON193 in Example 3A (but using CON166-specific primers). The probe 

10 consisted of a 259 bp fragment containing sequence at the 3' end of CON-166 

(nucleotides 7 1 5-974 of SEQ ID NO: 1 ) and containing terminal EcoRl and Bamill 
restriction sites. The riboprobes were labeled with [^^S]-IJTP to yield a specific 
activity of 0.40 x 10^ cpm/pmol for antisense riboprobes and 0.65 x 10* cpm/pmol for 
sense-strand riboprobes Hybridization with the riboprobes and subsequent washing 

15 of the slides was carried out as described above for CON193 in Example 3 A. 

Specific labeling with the antisense probe occurred in cortical regions, 
including the piriform complex, neostriatum, thalamus and hippocampus. The 
specificity of labeling was confirmed by microscopic analysis of emulsion-coated 
cryosections. These sections revealed that the autoradiographic grains resulting from 

20 antisense riboprobe in situ hybridizations were distributed over cell bodies rather than 

trapped between cell bodies. In contrast, hybridization using the riboprobe specific 
for the sense strand of CON166 produced a faint signal in the hippocampus only, but 
even this signal was found to be non-specific upon microscopic examination. The 
observed regional distribution of CON166 mRNA suggests that ligands for this GPCR 

25 may be involved in signal transductions important for cellular processes underlying 

neurological functioning. In addition, expression of CON 166 in the brain provides an 
indication that modulators of CON 166 activity have utility for treating neurological 
disorders, including but not limited to, schizophrenia, affective disorders, 
ADHD/ ADD (/.c. Attention Deficit-Hyperactivity Disorder/ Attention Deficit 

30 Disorder), and neural disorders such as Alzheimer's disease, Parkinson's disease, 
migraine, and senile dementia. Some other diseases for which modulators of 
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CON 166 may have utility include depression, anxiety, bipolar disease, epilepsy, 
neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of CON 166 
modulators, including CON166 ligands and anti-CONI66 antibodies, to treat 
individuals having such disease states is intended as an aspect of the invention. 

5 

C. CON .103 

A cocktail of two CON 1 03 -specific antisense oligonucleotide probes 
(CON 1 03a and CON103b) were used because of the relatively high GC content of the 
CON 103 coding region. The CON103a sequence 
1 0 (5TTTATTAAT ATTGGAAGGGACAA ACTGGAGAGCACAGAACAT3'; SEQ ID 

NO: 72) corresponds to the reverse complement of nucleotides 2196-2237 of SEQ ID 
NO: 5 and CON103b sequence (5'AAAGCCACCATGGA 

AGCCATGCCAAAGATGATGCTGGGCAAGAA 3'; SEQ ID NO: 73) corresponds 
to the reverse complement of nucleotides 195-1 538 of SEQ ID NO: 5, Terminal 

15 deoxynucleotidyltransferase and [a -"P]dATP were used to 3' end-label CON103a 
( 1 .36 X 1 0' cpm/pmol) and CON 1 03b (9. 1 x 1 0^ cpm/pmol). The probes were 
denatured by incubation at 70°C for three minutes and added to hybridization buffer 
containing 50% formamide, 10% dextran, 0.3 M NaCl, 10 mM Tris (pH 8.0), 1 mM 
EDTA, IX Denhardt's Solution, and 200 mM dithiothreitoL The final concentration 

20 of each radiolabeled probe was 2 pmol/ml of hybridization solution. Microscope 

slides containing sequential brain cryosections were independently exposed to 45 p.1 
of hybridization solution (containing the antisense oligonucleotide probes CON 103a 
and CON lG3b) per slide and silanized cover slips were placed over the sections being 
exposed to hybridization solution. Sections were incubated overnight (15-18 hours) at 

25 37°C to allow hybridization to occur. 

Following the hybridization period, coverslips were washed off the 
slides in IX SSC. The cryosections were then subjected to three high-stringency 
washes in 1 X SSC at 65°C for 20 minutes each. Following two room-temperature 
washes, cryosections were dehydrated by consecutive exposure to 70%, 95%, and 

30 100% ethanol (0.3 M ammonium acetate added to 70% and 95% ethanol solutions), 
followed by air drying and exposure to Kodak BioMax MR-1 fibn. After 28 days of 
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exposure, the Tilm was developed. Based on these results, brain sections that showed 
positive hybridization signals were coated with Kodak NTB-2 nuclear track emulsion 
and the slides were stored in the dark for four months. The shdes were then 
developed and counterstained with hematoxylin. Emulsion-coated sections were 
5 analyzed microscopically to determine the specificity of labeling. The signal was 

determined to be specific if autoradiographic grains (generated by anlisense probe 
hybridization) were present over cell bodies and not trapped between cell bodies. 

Specific labeling with the antisense probe occurred in all cortical 
regions, including the piriform cortex and hippocampus. The specificity of labeling 

1 0 was confirmed by microscopic analysis of emulsion-coated cryosections. These 

sections revealed that the autoradiographic grains resulting from antisense riboprobe 
in situ hybridizations were distributed over cell bodies rather than trapped between 
cell bodies. The observed distribution of CON 103 mRNA in the cortical and 
paralimbic regions of the mammalian brain suggests that ligands for this GPCR may 

15 be involved in signal transductions important for cellular processes underlying 

neurological functioning. In addition, expression of CON 1 03 in the brain provides an 
indication that modulators of CON103 activity have utility for treating neurological 
and neuropsychiatric disorders, including but not limited to, schizophrenia, 
depression, anxiety, attention deficit disorder (with or without hyperactivity), bipolar 

20 disease, epilepsy, migraine, neuritis, neurasthenia, neuropathy, neuroses, obesity, 
Parkinson's disease, other dementias, and the hke. Use of CONl 03 modulators, 
including CON 103 hgands and anti-CONl03 antibodies, to treat individuals having 
such disease states is intended as an aspect of the invention. 

25 D. CON203 

CON203-specific cRNA probes were prepared using conventional 
techniques, hiitially, a 293 bp fragment of the CON203 coding region, with a BamHl 
site and an EcoRl site disposed on opposite ends, was prepared by PCR using primers 
LW1314 (5'-GCATGAATTCCCACCTTCATCATCTACCTC-3'; SEQ ID NO: 40) 
30 and LW1315 (5'-GCATGGATCCGAAGACCAAAAAGACCCAG-3^ SEQ ID NO: 

41 ). LWl 314 includes an EcoR] site and additional protective residues at its 5' 
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temiinus, wilh the rest of the sequence corresponding to CON203 coding nucleotides 
164-183, which correspond to positions 309-328 of SEQ ID NO: 7. LW13I5 includes 
5' protective nucleotides and a BainHl site, with the rest of the sequence 
corresponding to the complement of CON203 coding nucleotides 438-456, which 
5 correspond to positions 583-601 of S EQ ID NO: 7. The PCR-amplified fragment was 

then digested with j^amHl and EcoR\ and Hgated into the corresponding sites of 
pBluescript TI to yield pCon203 BS. The recombinant clone was then linearized either 
with BamHi or£coR\. Linearization with BamHl provided a substrate for in vitro 
expression of a sense-strand cRNA probe using the vector-borne T7 promoter. 

10 Digestion with EcoKl was used to provide a substrate for in vitro transcription using 
the vector-borne T3 promoter to generate an anti-sense cRNA probe. In vitro 
transcriptions were performed in the presence of [^^S] UTP, thereby yielding sense- 
and anti-sense strand riboprobes having specific radioactivities of 5.38 x 10^ 
cpm/pmol and 5.34 x 10^ cpm/pmol, respectively. Hybridization with the riboprobes 

15 and subsequent washing of the slides was carried out as described above for CON193 

in Example 3 A. Subsequently, the slides were exposed to Kodak BioMax MR-1 film. 
After 9 days of exposure, the film was developed. Based on these results, brain 
sections that gave rise to positive hybridization signals were coated with Kodak 
NTB-2 nuclear track emulsion and the slides were stored in the dark for 25 days. The 

20 slides were then developed as described above for CON 193 in Example 3A. 

Specific labeling with the antisense probe occurred in several limbic 
and parahmbic regions, as well as areas thought to be involved in voluntary motor 
control. In particular, the probe hybridized to CON203 mRNAs in the following 
regions of the brain: cortical regions, including the piriform cortex, neostriatum, 

25 lateral olfactory tract, hypothalamic nuclei, bed nucleus of the stria terminalis, 

amygdala, hippocampus, reticular thalamus and other thalamic regions, subthalamic 
nucleus, and the red nucleus. The specificity of labeling was confirmed by 
microscopic analysis of emulsion-coated cryosections. These sections revealed that 
the autoradiographic grains resulting from antisense riboprobe in situ hybridizafions 

30 were distributed over cell bodies rather than trapped between cell bodies. Confirming 

expression of CON203 mRNA, die sense-strand riboprobe did not show specific 
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hybridization. The obsei-vecl distribution ofCONlOB niRNA in the cortical 
(particularly, motor circuits) and parahmbic regions of the mammalian brain suggests 
that CON203 and the ligands for this GPCR may be involved in signal transductions 
important for cellular processes underlying neurological functioning. In addition, 
expression of CON203 in Ihe brain provides an indication that modulators of 
COM203 activity have utility for treating neurological disorders, including but not 
limited to, schizophrenia, depression, anxiety, bipolar disease, epilepsy, migraine, 
attention deficit disorder (with or without hyperactivity), neuritis, neurasthenia, 
neuropathy, neuroses, Parkinson's disease, dementia, obesity, and the like. Use of 
CON203 modulators, including CON203 hgands and anti-CON203 antibodies, to 
treat individuals having such disease states is intended as an aspect of the invention. 

E. CON198 

A 266 bp fragment of C0N1 98 containing EcoRl and BamHl 
restriction sites was amplified from the full-length clone by PGR, using the primers 
LW1308: 5'-GCATGAAIICACTCACTTCTCATCTCCTTC-3' (SEQ ID NO: 46) 
and L W 1 309 :5 '-GCAT GG ATCC AATCTCCTTTGTCTTC ACTC-3 ' (SEQ ID NO: 
47) Primer LW1308 contains an EcoRl site (underiined) followed by sequence 
identical to nucleotides 638-657 of SEQ ID NO: 9. Primer LW1309 contain a RamUl 
site (underlined) followed by sequence complementary to nucleotides 903-884 of SEQ 
ID NO: 9. The amplification product was digested with EcoRl and BamHl, and then 
subcloned into an EcoRl- and ^awHl-digested pBluescript II vector {Stratagene). The 
266 amplified and subcloned basepairs correspond to nucleotides 638 to 903 of SEQ 
ID NO: 9. 

The subcloned CON198-Bluescript construct was used to generate 
strand-specific probes for the in situ hybridization experiments. The construct was 
linearized with BamHl, for labeling with T7 polymerase (sense), or EcoRl, for T3 
polymerase (antisense), and used as a template for in vitro transcription of sense and 
antisense cRNA riboprobes. The riboprobes were labeled with ^^S-UTP to yield a 
specific activity of 0.45 x 10^ cpni/pmoi for antisense and 0.732 x 10*^ cpm/pmol for 
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sense probe. Hybridization with the riboprobes and subsequent washing of the 
slides was carried out as described above for CON 193 in Example 3 A. 

Specific labeling with the antisense probe showed distribution of 
CON198 mRNA in the rat brain in several limbic and paralimbic regions as well as 
5 areas thought to be involved in voluntary motor control. Labelled regions included 

cortical regions, piriform cortex, hypothalamic nuclei (paraventricular nucleus, 
supraoptic nucleus, suprachiasmatic nucleus), hippocampus, reticular thalmus, 
substantia nigra-pars compacla (SN-C), ventral tegmental area, and the red nucleus. 
The specificity of labeling was confimied by microscopic analysis of emulsion coated 

1 0 sections. These sections revealed that the autoradiographic grains generated by the 
antisense probe were distributed over cell bodies rather than trapped between cell 
bodies. Sense probe did not generate specific labeling. 

The observed regional distribution of CON 198 mRNA provides a 
therapeutic indication for natural ligands for CON 198 as well as modulators of 

1 5 CON198 activity, such as anti-CON198 antibody substances or small molecules that 

agonize or antagonize ligand-mediated CON 1 98 signalling. Tn particular, the 
expression pattern provides an indication that such molecules will have utihty for 
treating neurological and/or psychiatric diseases, including but not limited to 
schizophrenia, depression, anxiety, bipolar disease, affective disorders, ADHD/ ADD, 

20 epilepsy, neuritis, neurasthenia, neuropathy, neuroses, Alzheimer's disease, 
Parkinson's disease, migraine, senile dementia, and the like. Use of CON198 
modulators, including CON198 Hgands and anti-CON198 antibodies, to treat 
individuals having such disease states is intended as an aspect of the invention. Such 
modulators are administered by any means effective to safely deliver the modulators 

25 to the CON198-expressing cells, including but not limited to oral administration, 

inhalation, or injection of compositions comprising the modulators in a 
pharmaceutically acceptable diluent, adjuvant, or carrier. Efficacy of treatment can 
initially be determined in any accepted animal model that provides a biochemical or 
behavioral marker that correlates with disease severity or treatment efficacy. 



30 
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F. CON.197 

A 261 bp fragment of CON 197 containing EcoRl and BaniHl 
reslriclion sites was amplified from the full-length clone by PGR, using the primers 
LWI306; 5'-GCATGAATTC TTCTACTTCATCATCCTCC -3' (SEQ ID NO: 50) and 
5 LW1307: 5'-GCATGGATC CAAAGGCCATCACAACAAG -3' (SEO ID NO: 51). 

Primer LWI306 includes sequence identical to nucleotides 100-1 18 of SEQ ID NO: 
1 1 (underlined), preceded by anEcoRl site. Primer LW1307 includes sequence 
complementary to nucleotides 361-343 of SEQ ID NO: 1 1 (underlined), preceded by a 
BamHl restriction site. The amplification product was digested with EcoRl and 

10 BamHl, and then subcloned into an EcoRl- and 5awHI-digested pBluescript 11 vector 
(Stratagene), The 261 amplified and subcloned basepairs correspond to nucleotides 
100 to 361 ofSEQIDNO: 11. 

The subcloned CON197-Bluescript construct was used to generate 
strand-specific probes for the in situ hybridization experiments. The construct was 

1 5 linearized with BamHi, for labeling with T7 polymerase (sense), or EcoRl, for T3 

polymerase (antisense), and used as a template for in vitro transcription of sense and 
antisense cRNA riboprobes. The riboprobes were labeled with ^^S-UTP to yield a 
specific activity of 0.51 x 10^ cpm/pmol for antisense and 0.432 x 10^ cpm/pmol for 
sense probe. Hybridization with the riboprobes and subsequent washing of the slides 

20 was carried out as described above for CONl 93 in Example 3 A. 

Specific labeling with the antisense probe showed wide spread 
distribution of CON 197 mRNA in the rat brain. Labelled regions included neo and 
alio cortex, pirifomi cortex, neostriatum, thalamic nuclei, hypothalamic nuclei, 
hippocampus, amygdala, cerebellum, and the olfactory bulb. The specificity of 

25 labeling was confirmed by microscopic analysis of emulsion coated sections. These 

sections revealed that the autoradiographic grains generated by the antisense probe 
were distributed over cell bodies rather than trapped between cell bodies. Sense probe 
did not generate specific labeling. 

The observed regional distribution of CONl 97 mRNA provides a 

30 therapeutic indication for natural ligands for CON 197 as well as modulators of 

CON 197 activity, such as anti-CON197 antibody substances or small molecules that 
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agonize or anUigoiiize ligand-medialed CON 197 signalling. In particular, the 
expression paltern provides an indication that such molecules will have utility for 
treating neurological and/or psychiatric diseases, including but not limited to 
dementia, schizophrenia, depression, anxiety, bipolar disease, migraine, Parkinson's 
5 disease, affective disorders, Alzheimer's disease, senile dementia, attention deficit 

hyperactivity disorder/attention deficit disorder (ADHD/ADD), epilepsy, neuritis, 
neurasthenia, neuropathy, neuroses, and the like. Use of CON 197 modulators, 
including CON 1 97 ligands and anti-CON197 antibodies, to treat individuals having 
such disease states is intended as an aspect of the invention. Such modulators are 

1 0 administered by any means effective to safely deliver the modulators to the CON 1 97- 

expressing cells, including but not limited to oral administration, inhalation, or 
injection of compositions comprising the modulators in a pharmaceutical ly acceptable 
diluent, adjuvant, or carrier. Efficacy of treatment can. initially be determined in any 
accepted animal model that provides a biochemical or behavioral marker that 

1 5 correlates with disease severity or treatment efficacy. 

G. CON202 

A 272 bp fragment of CON202 containing EcoRJ and BamHl 
restriction sites was amplified from the full-length clone by PCR, using the primers 

20 LW 1 3 1 0 GCATGAATTCGC AGAAGAAGGCTATTGG (SEQ ID NO: 56) and 

LW1311 GCATGGATCCGCAGTAAAGAAGGGTTGTG (SEQ ED NO: 57). The 
amplification product was digested with_EcoRI and BamHl, and then subcloned into a 
pBIuescript U vector (Strategene) that was digested with EcoRI and BamHl. The 272 
amplified and subcloned basepairs correspond to nucleotides 1065 to 1 336 of SEQ ID 

25 NO: 13. 

The subcloned CON202-Bluescript construct was used to generate 
strand-specific probes for the in situ hybridization experiments. The construct was 
linearized with BamHl, for labeling witli T7 polymerase (sense), or EcoRT, for T3 
polymerase (antisense), and used as a template for in vitro transcription of sense and 
30 antisense cRNA riboprobes. The riboprobes were labeled with ^^S-UTP to yield a 

specific activity of 4.7 x 10^ cpm/pmol for antisense and 4.3 x 10' cpm/pmol for sense 
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probe. Hybridization with the riboprobes and subsequent washing of the slides was 
carried out as described above for CON 193 in Example 3 A, 

Specific labeling with the antisense probe showed wide spread 
distribution of CON202 niRN A in the rat brain. Labelled regions included the 
cortical regions, lateral olfactory nuclei, hippocampus, subthalamic nucleus, and at a 
lower level, the nigra-pars compacta. 

The observed regional distribution of CON202 mRNA provides a 
therapeutic indication for natural ligands for CON202 as well as modulators of 
CO1SI202 activity, such as anti-CON202 antibody substances or small molecules that 
agonize or antagonize ligand-mediated CON202 signaling. In particular, the 
expression pattern provides an indication that such molecules will have utihty for 
treating neurological and/or psychiatric diseases, including but not limited to 
schizophrenia, affective disorders, attention deficit hyperactivity disorder/attention 
deficit disorder, depression, anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, 
neuropathy, neuroses, Alzheimer's disease, Parkinson's disease, migraine, senile 
dementia and tlie like. Use of CON202 modulators, including CON202 ligands and 
anti-CON202 antibodies, to treat individuals having such disease states is intended as 
an aspect of the invention. Such modulators are administered by any means effective 
to safely deliver the modulators to the CON202 -expressing cells, including but not 
limited to oral administration, inhalation, or injection of compositions comprising the 
modulators in a pharmaceutically acceptable diluent, adjuvant, or carrier. Efficacy of 
treatment can initially be determined in any accepted animal model that provides a 
biochemical or behavioral marker that correlates with disease severity or treatment 
efficacy. 

H. CON222 

A 264 bp fragment of CON222 containing EcoRl and BamHl 
restriction sites was amplified from the flill-length clone by PCR, using the primers 
LW1472 (5'GCATGAA1ICTGCCATGTCAATCATTTCTCTC3'; SEQ ID NO: 62, 
EcoRI site is underlined) and LW1473 (5'GCATGGATCCGTTCTGCATTTTCC- 
AGGTCTC3'; SEQ ID NO: 63, BaraHI site is underiined). The amplification product 
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was digested with EcoRl and BamHl, and then subcloned into a piedigested 
pBliiescript ]\ vector (Stratagene). The 264 amplified and subcloned basepairs 
correspond to nucleotides 237 to 500 ofSEQ ID NO: 15. 

The subcloned CON222-B!uescript construct was used to generate 
5 strand-specific probes for the in situ hybridization experiments. The construct was 

linearized with BamHI, for labeling with T7 polymerase (sense), or EcoRI, for T3 
polymerase (antisense), and used as a template for in vitro transcription of sense and 
antisense cRNA riboprobes. The riboprobes were labeled with ^^S-UTP to yield a 
specific activity of 4.25 x 10^ cpm/pmol for antisense and 3.9 x 10^ cpm/pmol for 

1 0 sense probe. Hybridization with the riboprobes and subsequent washing of the slides 
was carried out as described above for CON 193 in Example 3 A. 

Specific labeling with the antisense probe showed wide spread 
distribution of CON222 mKNA in the rat brain. Labelled regions included the 
cortical regions, piriform cortex, striatum, hippocampus, thalamus, hypothalamus, 

1 5 dorsal raphe, and habenula. 

The observed regional distribution of CON222 mRNA provides a 
therapeutic indication for natural ligands for CON222 as well as modulators of 
CON222 activity, such as anti-CON222 antibody substances or small molecules that 
agonize or antagonize ligand-mediated CON222 signaling. In particular, the 

20 expression pattern provides an indication that such molecules will have utility for 
treating neurological and/or psychiatric diseases, including but not limited to 
schizophrenia, affective disorders, attention deficit hyperactivity disorder/attention 
deficit disorder, depression, anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, 
neuropathy, neuroses, Alzhemeimer's disease, Parkinson's Disease, migraine, senile 

25 dementia, and the like. Use of CON222 modulators, including CON222 ligands and 

anti-CON222 antibodies, to treat individuals having such disease states is intended as 
an aspect of the invention. Such modulators are administered by any means effective 
to safely deliver the modulators to the CON222-expressing cells, including but not 
limited to oral administration, inhalation, or injection of compositions comprising the 

30 modulators in a pharmaceutical ly acceptable diluent, adjuvant, or carrier. Efficacy of 
treatment can initially be detennined in any accepted animal model that provides a 
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biochemical or behavioral marker that correlates with disease severity or treatment 
efficacy. 

L CQN215 

5 A 26 1 bp fragment of C0N21 5 containing EcoRl and BamEi 

restriction sites was amplified from the full-length clone by PCR, using the primers 
LW141 1; 5^GCATGAATICTGCCAAACATCATCCTGAC-3' (SEQ ID NO: 64) 
and LW1412: 5'-GCATGGATCCTACACAGCCACAACAACCC-3' (SEQ ID NO: 
65). Primer LW14t 1 contains an ^coRI site (underlined) followed by sequence 

10 identical to CON215 coding nucleotides 521-537, which correspond to positions 533- 

549 of SEQ ID NO: 17. Primer LW1412 contain a Bamm site (underlined) followed 
by sequence complementary to CON215 coding nucleotides 764-781, which 
correspond to positions 776-793 of SEQ ID NO: 17. The amplification product was 
digested with EcoRl and BamHL, and then subcloned into an EcoKi- and BamHl- 

1 5 digested pBluescript n vector (Stratagene). The 26 1 amplified and subcloned 

basepairs correspond to nucleotides 521 to 781 of SEQ ID NO: 17. 

The subcloned CON215-Bluescript construct was used to generate 
strand-specific probes for the in situ hybridization experiments. The construct was 
Unearized with BamBl, for labeling with T7 polymerase (sense), or EcoR], for T3 

20 polymerase (antisense), and used as a template for in vitro transcription of sense and 
antisense cRNA riboprobes. The riboprobes were labeled* with ^^S-UTP to yield a 
specific activity of 48.03 x 10^ cpm/pmol for antisense and 48.09 x lO'' cpm/pmol for 
sense probe. Hybridization with the riboprobes and subsequent washing of the slides 
was carried out as described above for CON193 in Example 3 A. 

25 Subsequently, the slides were exposed to Kodak BioMax MR-1 film. 

After 9 days of exposure, the film was developed. Shdes containing sections that 
showed a hybridization signal on film autoradiograms were coated with Kodak 
NTB-2 nuclear track emulsion and stored in the dark for 25 days. The slides were then 
developed as described above for CON 1 93 in Example 3 A. 

30 Specific labeling with the antisense probe showed distribufion of 

CON215 mRNA in the rat brain in limbic endocrine and motor circuits. Specifically, 
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CON215 mRNA was present in the cortex, hippocampus, and red nucleus. The 
specificity oflabehng was confinned by microscopic analysis of emulsion coated 
sections. These sections revealed that the autoradiographic grains generated by the 
antisense probe were distributed over cell bodies rather than trapped between cell 
5 bodies. Sense probe did not generate specific labehng. 

The observed regional distribution of C0N21 5 mRNA provides a 
therapeutic indication for natural ligands for CON21 5 as well as modulators of 
CON2 15 activity, such as anti-CON215 antibody substances or small molecules that 
agonize or antagonize ligand-mediated CON1215 signaling. In particular, the 

10 expression pattern provides an indication that such molecules will have uliHty for 

treating neurological and/or psychiatric diseases, including but not limited to 
schizophrenia, depression, anxiety, bipolar disease, epilepsy, migraine, attention 
deficit (with or without hyperactive disorder), neuritis, neuasthenia, neuropathy, 
neuroses, Parkinson's disease, dementia, obesity, and the like. Use of CON21 5 

1 5 modulators, including CON21 5 ligands and anti-C0N21 5 antibodies, to treat 

individuals having such disease states is intended as an aspect of the invention. 

Such modulators are administered by any means effective to safely 
deliver the modulators to the CON215-expressing cells, including but not limited to 
oral administration, inhalation, or injection of compositions comprising the 

20 modulators in a pharmaceutically acceptable diluent, adjuvant, or carrier. Efficacy of 
treatment can initially be determined in any accepted animal model that provides a 
biochemical or behavioral marker that correlates with disease severity or treatment 
efficacy. 

25 J. CON 217 

Two oligonucleotides were designed based on SEQ ID NO: 1 9 and 
obtained from Sigma-Genosys (St. Louis, MO) to use as probes for in situ 
hybridization. The first oligonucleotide, designated 2 17A, has the sequence 
5TAGGTCGGTAGTCAGGACACGGGAGAACAGAACTGTTGGTTGA3' (SEQ 

30 ID NO: 68) which is complementary to nucleotides 102 to 60 of SEQ TD NO: 19. The 

second oligonucleotide, designated 21 7B, has the sequence 



wo 01/31014 



PCT/USOO/29601 



- 113 - 

5'GCCCCTGTGGCGGTTTAGATCGAGAATGCCCATTTTCTGTTCCATCTAAC 
CA3' (SEQ ID NO: 69) which coiTesponds to the complement of nucleotides 1530 to 
1479oFSEQ lD NO: 17. Both oligonucleotides, 2 17A and 21 7B, were reconstituted 
with Ix TE buffer to a concentration of 20 pMol/ml and labeled with ^^P-dATP to 
5 yield a specific activity of 2.08 x 10^ and 1 .53 x 10^ cpm/ml, respectively. 

Hybridization was carried out at 37°C overnight as described above for 
CON193 in Example 3 A. Following the hybridizations, the coversHps were washed 
off the shdes with Ix SSC for 45 minutes. The slides were then washed for 20 
minutes at room temperature in Ix SSC followed by three high stringency washes in 
10 Ix SSC at 65°C. After washing, the slides were dehydrated with 70%, 95%, and 

100% ethanoi containing 0.3 mM NH4OAC, air-dried, and exposed to Kodak BioMax 
M R-1 film. After 21 days of exposure, the film was developed. Based on these 
results, sections that showed a hybridization signal on film autoradiography were 
coated with Kodak NTB-2 nuclear track emulsion and stored in the dark for 42 days. 
15 The slides were then developed and counterstained with hematoxylin. Emulsion- 

coated sections were analyzed microscopically to determine the specificity of labeling. 
The signal was judged to be specific if autoradiographic grains (generated by 
antisense probe hybridization) were associated clearly with crystal violet stained cell 
bodies. Autoradiographic grains found between cell bodies were deemed non- 
20 specific. 

Specific labeling with the antisense probe showed wide spread 
distribution of CON217 mRNA in the rat brain. Labelled regions included the cortex, 
piriform cortex, hippocampus, cerebellum, medulla, spinal cord, temporal lobe, 
putamen, substantia nigra and thalamus. 

25 The observed regional distribution of CON217 mRNAs provide a 

therapeutic indication for natural ligands for these G protein-coupled receptors as well 
as modulators of their activity, such as anti-C0N21 7 antibody substances or small 
molecules that mimic, agonize or antagonize ligand-mediated CON217 signaling. In 
particular, the expression patterns provide an indication that such molecules will have 

30 utility for treating neurological and/or psychiatric diseases, including but not limited 
to schizophrenia, affective disorders, attention deficit hyperactivity disorder/attention 
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deficit disorder, depression, anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, 
neuropathy, neuroses, Alzhemeimer^s disease, Parkinson's Disease, migraine, senile 
dementia, and the like. Use of CON217 polypeptide modulatorsj including C0N2] 7 
ligands and antj-CON2J7 polypeptide antibodies, to treat individuals having such 
5 disease states is intended as an aspect of the invention. Such modulators are 

administered by any means effective to safely deliver the modulators to the GPCR 
polypeptide-expressing cells, including but not limited to oral administration, 
inhalation, or injection of compositions comprising the modulators in a 
pharmaceutically acceptable diluent, adjuvant, or carrier. Efficacy of treatment can 
10 initially be determined in any accepted animal model that provides a biochemical or 
behavioral marker that correlates with disease severity or treatment efficacy. 



EXAMPLE 4 

R ecombinant Expression of GPCR Polypeptides in Eukarvotic Host Cells 

1 5 To produce GPCR protein, a GPCR polypeptide-encoding 

polynucleotide is expressed in a suitable host cell using a suitable expression vector, 
using standard genetic engineering techniques. For example, one of the GPCR 
polypeptide-encoding sequences described in Example 1 (such as SEQ ID NOS: 1, 3, 
5, 7, 9, 1 1 , 13, 15, 1 7 or 19) is subcloned into the commercial expression vector 

20 pzeoSV2 (Invitrogen, San Diego, CA) and transfected into Chinese Hamster Ovary 

(CHO) cells (ATCC CRL-1 781) using the transfection reagent fuGENE 6 
(Boehringer-Mannheim) and the transfection protocol provided in the product insert. 
Additional eukaryotic cell lines, such as African Green Monkey Kidney cells (COS- 
7, ATCC CRL-1651) or Human Kidney cells (HEK 293, ATCC CRL-1573), may be 

25 used as well. Cells stably expressing a GPCR polypeptide (e.g., CON193, CON 166, 

CON103, CON203, CON198, CON197, CON202, CON222, CON215, or CON217) 
are selected by growth in the presence of 100 mg/ml zeocin (Stratagene, LaJoila, 
CA). Optionally, GPCR polypeptide is purified from the cells using standard 
chromatographic techniques. To facilitate purification, antisera is raised against one 

30 or more synthetic peptide sequences that correspond to portions of the GPCR amino 

acid sequence, and the antisera is used to affinity purify GPCR polypeptides. The 
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GPCR gene also may be expressed in frame with a tag sequence (e.g., potyhistidine, 
hemaggiuttinin, FLAG) to facilitate purification. Moreover, it will be appreciated that 
many ofthe uses for GPCR polypeptides, such as assays described below, do not 
require purification of GPCR polypeptides from the host cell. 

EXAMPLE 5 
Antibodies to GPCR Polypeptides 

Standard techniques are employed to generate polyclonal or 
monoclonal antibodies to the GPCR receptors (e.g., CON 1 93, CON 166, CON 103, 
CON203, CON198, CON197, CON202, CON222, CON215, or C0N2] 7), and to 
generate useful antigen-binding fragments thereof or variants thereof, including 
"humanized" variants. Such protocols can be found, for example, in Sambrook et aL, 
Molecular Cloning: a Laboratory Manual. Second Edition, Cold Spring Harbor, 
New York: Cold Spring Harbor Laboratory (1989); Hariow et ai (Eds), Antibodies A 
Laboratory Manual, Cold Spring Harbor Laboratory; Cold Spring Harbor , NY 
(1988); and other documents cited below. In one embodiment, recombinant GPCR 
polypeptides (or cells or cell membranes containing such polypeptides) ofthe 
invention are used as an antigen to generate the antibodies. In another embodiment, 
one or more peptides having amino acid sequences corresponding to an immunogenic 
portion of a GPCR polypeptide {e.g„ 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, or more amino acids) are used as antigen. Peptides corresponding to extracellular 
portions of GPCR polypeptides, especially hydrophilic extracellular portions, are 
preferred. The antigen may be mixed with an adjuvant or linked to a hapten to 
increase antibody production. 

A. Polyclonal or Monoclonal antibodies 

As one exemplary protocol, a recombinant GPCR polypeptide or 
synthetic fragment thereof is used to inununize a mouse for generation of monoclonal 
antibodies (or larger mammal, such as a rabbit, for polyclonal antibodies). To 
increase antigenicity, peptides are conjugated to Keyhole Lympet Hemocyanine 
(Pierce), according to the manufacturer's recommendations. For an initial injection. 
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ihe antigen is emulsified with Freuncrs Complete Adjuvant and injected 
subcutaneously. At intervals of two to three weeks, additional aliquols of GPCR 
antigen are emulsified with Freund's incomplete Adjuvant and injected 
subculaneously. Prior to the fmai booster injection, a seaim sample is taken from the 
5 immunized mice and assayed by Western blot to confirm the presence of antibodies 

that immunoreact with GPCR polypeptide. Seium from the immunized animals may 
be used as a polyclonal aiitisera or used to isolate polyclonal antibodies that recognize 
GPCR polypeptide. Alternatively, the mice are sacrificed and their spleen removed 
for generation of monoclonal antibodies. 

10 To generate monoclonal antibodies, the spleens are placed in 10 ml 

serum-free RPMl 1640, and single cell suspensions are formed by grinding the 
spleens in serum-free RPMl 1640, supplemented with 2 mM L-glutamine, 1 mM 
sodium pyruvate, 100 units/ml penicillin, and 100 ^ig/ml streptomycin (RPMl) 
(Gibco, Canada). The cell suspensions are filtered and washed by centrifugation and 

15 resuspended in serum-free RPML Thymocytes taken from three naive Balb/c mice 

are prepared in a similar manner and used as a Feeder Layer, NS-1 myeloma cells, 
kept in log phase in RPMl with 10% fetal bovine serum (FBS) (Hyclone Laboratories. 
Inc., Logan, Utah) for three days prior to fusion, are centrifiiged and washed as well. 

To produce hybridoma fusions, spleen cells fi*om the immunized mice 

20 are combined with NS-1 cells and centrifuged, and the supernatant is aspirated- The 
cell pellet is dislodged by tapping the tube, and 2 ml of 37°C PEG 1500 (50% in 
75mM Hepes, pH 8.0) (Boehringer Mannheim) is stirred into the pellet, followed by 
the addition of serum- free RPML Thereafter, the cells are centrifuged and 
resuspended in RPMl containing 15% FBS, 1 00 ^M sodium hypoxanthine, 0.4 |iM 

25 aminoptei-in, 16 ^M thymidine (HAT) (Gibco), 25 units/ml of lL-6 (Boehringer 

Mannheim) and 1.5x10^ thymocytes/ml and plated into 1 0 Coming flat-bottom 
96- well tissue culture plates (Coming, Coming New York). 

On days 2, 4, and 6, after the fusion, 100 |il of medium is removed 
from the wells of the fusion plates and replaced with fresh medium. On day 8, the 

30 fusions are screened by ELISA, testing for the presence of mouse IgG that binds to a 



wo 01/31014 



PCT/USOO/29601 



-117- 

GPCR polypeptide. Selecled fusion wells are further cloned by dilution until 
monoclonal cultures producing anti-GPCR polypeptide antibodies are obtained. 

B. Humanization of Anti-GPCR Monoclonal Antibodies 

5 The expression patterns of GPCR polypepties as reported herein and 

the proven track record of GPCR*s as targets for therapeutic intervention suggest 
therapeutic indications for GPCR polypeptide inhibitors (antagonists). GPCR 
polypeptide-neutralizing antibodies comprise one class of therapeutics useful as 
antagonists. Following are protocols to improve the utility of anti-GPCR polypeptide 

10 monoclonal antibodies as therapeutics in humans, by "humanizing" the monoclonal 
antibodies to improve their serum half-life and render them less immunogenic in 
human hosts (z.e, to prevent human antibody response to non-human anti-GPCR 
polypeptide antibodies). 

The principles of humanization have been described in the literature 

1 5 and are facilitated by the modular arrangement of antibody proteins. To minimize the 

possibility of binding complement, a humanized antibody of the IgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating 
chimeric antibodies comprising the variable domains of non-human antibody proteins 

20 of interest with the constant domains of human antibody molecules. (See, e.g. , 

Morrison and Oi, Adv. Immunol., 44:65-92 (1989). The variable domains of GPCR- 
neutralizing anti-GPCR antibodies are cloned from the genomic DNA of a B-cell 
hybridoma or from cDNA generated from mRNA isolated from the hybridoma of 
interest. The V region gene fragments are linked to exons encoding human antibody 

25 constant domains, and the resultant construct is expressed in suitable mammalian host 

cells (e.g., myeloma or CHO cells). 

To achieve an even greater level of humanization, only those portions 
of the variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are 

30 cloned into human antibody sequences. [See, e.g., Jones et ciL, Nature, 527:522-525 

(1986); Riechmann et aL, Nature, 332323-327 (1988); Verhoeyen et ciL Science, 
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259:1534-36 (1988); and Tempest et ciL, Bio/Technology. P:266-71 (1991). If 
necessary, the P-sheet framework oT the human antibody surrounding the CDR3 
regions also is modified to more closely mirror the three dimensional structure of the 
antigen-binding domain of the original monoclonal antibody. (See Kettleborough 
5 et aL, Protein Engin., 4:173-7E3 ( 1 99 1 ); and Foote et ciL, J. Mol. BioL, 22^:487-499 

(1992). 

in an alternative approach, the surface of a non-human monoclonal 
antibody of interest is humanized by altering selected surface residues of the 
non-human antibody, e.g., by site-directed mutagenesis, while retaining all of the 
1 0 interior and contacting residues of the non-human antibody. See Padlan, Molecular 
Immunol., 28(4/5)'A%9-9% (1991). 

The foregoing approaches are employed using GPCR-neutralizing 
anti-GPCR monoclonal antibodies and the hybridomas that produce them to generate 
humanized GPCR-neutralizing antibodies useful as therapeutics to treat or palliate 
1 5 conditions wherein GPCR expression or ligand-mediated GPCR signaling is 

detrimental. 

C. Human GPCR-Neutralizing Antibodies from Pha^e Display 

Human GPCR-neutralizing antibodies are generated by phage display 
20 techniques such as those described in Aujame et aL, Human Antibodies, 8(4): 1 55-168 

(1997);Hoogenboom, TIBTECH, 75:62-70(1997); and Rader a/., Curr. Opin. 
Biotechnol, 5:503-508 (1997), all of which are incorporated by reference. For 
example, antibody variable regions in the form of Fab fragments or linked single 
chain Fv fragments are fused to the amino terminus of filamentous phage minor coat 
25 protein pHI. Expression of the fusion protein and incorporation thereof into the 

mature phage coat results in phage particles that present an antibody on their surface 
and contain the genetic material encoding the antibody. A phage library comprising 
such constructs is expressed in bacteria, and the library is panned (screened) for 
GPCR-specific phage-antibodies using labelled or immobilized GPCR polypeptide as 
30 antigen-probe. 
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D. Human GPCR-Neutralizinfy Antibodies from Trans2enic Mice 

Human GPCR-neutralizing antibodies are generated in transgenic mice 
essentially as described in Bruggemann and Neiiberger, Immunol. Today, 
/7(SJ:39\'97 (1996) and Bmggemann and Taussig, Curr, Opin. BiotechnoL, 5:455-58 
5 ( 1 997). Transgenic mice carrying human V-gene segments in gemiline configuration 

and that express these transgenes in their lymphoid tissue arc immunized with a 
GPCR composition using conventional immunization protocols. Hybridomas are 
generated using B cells from the immunized mice using conventional protocols and 
screened to identify hybridomas secreting anti-GPCR human antibodies (e.g., as 
10 described above). 

EXAMPLE 6 

Assays to Identify Modulators of GPCR Polypeptide Activity 

Set forth below are assays for identifying modulators (agonists and 

15 antagonists) of GPCR polypeptide activity. Among the modulators that can be 

identified by these assays include natural ligand compounds of the receptor; synthetic 
analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic compounds identified through high 

20 throughput screening of libraries; and the like. All modulators that bind GPCR 

polypeptide are usefnl for identifying GPCR polypeptide in tissue samples {e.g,, for 
diagnostic purposes, pathological purposes, and the like). Agonist and antagonist 
modulators are useful for up-regulating and down-regulating GPCR polypeptide 
activity, respectively, to treat disease states characterized by abnormal levels of GPCR 

25 polypeptide activity. GPCR polypeptide binding molecules also may be used to 

deliver a therapeutic compound or a label to cells that express GPCR polypeptide 
{e.g., by attaching the compound or label to the binding molecule). The assays may 
be performed using single putative modulators, and/or may be performed using a 
known agonist in combination with candidate antagonists (or visa versa). 

30 Performance of the assays using any of the GPCR polypeptides of the invention 

described herein (e.g., CON193, CON 166, CON 103, CON203, CON198, CON 197, 
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CON202, CON222, CON215, or C0N21 7) is contemplated. It will be appreciated 
that co-transfecting cells with two or more of the receptors for simultaneous screening 
also is possible. 



5 A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate 
(cAMP) are measured in GPCR-transfected cells that have been exposed to candidate 
modulator compounds. Protocols for cAMP assays have been described in the 
literature. [See, e.g., Sutherland et al, Circulation, 37: 279 (1968); Frandsen, E.K. 

1 0 and Krisluia, G, Life Sciences, 18: 529-541 (1 976); Dooley ei aL, Journal of 

Pharmacology and Experimental Therapeutics, 283 (2): 735-41 (1997); and George et 
ai. Journal of Biomolecular Screening, 2 (4): 235-40 (1997).] An exemplary 
protocol for such an assay, using an Adenylyl Cyclase Activation FlashPlate® Assay 
from NEN^*^ Life Science Products, is set forth below. 

1 5 Briefly, the GPCR coding sequence {e,g,, a cDN A or intronless 

genomic DNA) is subcloned into a commercial expression vector, such as pzeoSV2 
(Invitrogen, San Diego, CA), and transiently transfected into Chinese Hamster Ovary 
(CHO) cells using known methods, such as the transfection reagent FuGENE 6 
(Boehringer-Mannheim) and the transfection protocol provided in the product insert. 

20 The transfected CHO cells are seeded into the 96 well microplates 

from the FlashPlate® assay kit, which are coated with solid scintillant to which 
antisera to cAMP has been bound. For a control, some wells are seeded with wild 
type (untransfected) CHO cells. Other wells on the plate receive various amounts of 
cAMP standard solution for use in creating a standard curve. 

25 One or more test compounds are added to the cells in each well, with 

water and/or compound-free media/diluent serving as a control. After treatment, 
cAMP is allowed to accumulate in the ceils for exactly 15 minutes at room 
temperature. The assay is terminated by the addition of lysis buffer containing ['"!]- 
labelled cAMP, and the plate is counted using a Packard Topcount'^'^ 96-well 

30 microplate scintillation counter. Unlabelled cAMP from the lysed cells (or from 

standards) competes with the fixed amounts of ['''^^I]-cAMP for antibody bound to the 
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plate, A standard curve is constructed, and cAMP values for the unknowns are 
obtained by interpolation. Changes in intracellular cAMP level of the cells in 
response to exposure to a test compound are indicative of GPCR polypeptide 
modulating activity. Modulators that act as agonists at receptors which couple to the 
5 Gs subtype of G-proteihs will stimulate production of cAMP, leading to a measurable 

3-10 fold increase. Receptor agonists which couple to the Gi/o subtype of G-proteins 
will inhibit forskolin-stiniulated cAlVEP production, leading to a measurable decrease 
of 50-100%. Modulators that act as inverse agonists will reverse these effects at 
receptors that are either constitutively active or activated by known agonists, 

10 

B. Aeauorin Assays 

in another assay cells {e.g., CHO cells) are transiently co-transfected 
with both a GPCR expression construct and a construct that encodes the photoprotein 
apoaequorin. In the presence of the cofactor coelenterazine, apoaequorin will emit a 

15 measurable luminescence that is proportional to the amount of intracellular 

(cytoplasmic) free calcium. [See generally Cobbold P.H. and Lee, J.A.C. "Aequorin 
measurements of cytoplasmic free calcium. In: McCormack J.G. and Cobbold P.H., 
eds., Cellular Calcium: A Practical Approach. Oxford:IRL Press (1991); Stables et 
al, Analytical Biochemistry, 252: 1 1 5-26 (1997); and Haugland, R.P. Handbook of 

20 Fluorescent Probes and Research Chemicals. Sixth edition. Eugene OR: Molecular 
Probes (1996).] 

In one exemplary assay, a GPCR-encoding polynucleotide is subcloned 
into the commercial expression vector pzeoSV2 (Invitrogen, San Diego, CA) and 
transiently co-transfected along with a construct that encodes the photoprotein 

25 apoaequorin (Molecular Probes, Eugene, OR) into CHO cells using the transfection 

reagent FuGENE 6 (Boehringer-Mannheim) and the transfection protocol provided in 
the product insert. 

The cells are cultured for 24 hours at 37X in aMEM (Gibco/BRL, 
Gaithersburg, MD) supplemented with 10% FBS, 2 mM glutamine, 10 U/ml of 

30 penicillin and 10 ^tg/ml of streptomycin. Subsequently, the media is changed to 

seruni-free aMEM containing 5 ^M coelenterazine (Molecular Probes, Eugene, OR), 
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and the cells are cultured for two additional hours at 37°C. Cells are then detached 
from the plate using VERS EN (Cibco/BRL), washed and resuspended at 2 x 10' 
ceils/ml in serum- free otMEM. 

Dilutions of candidate GPCR modulator drugs are prepared in serum- 
5 free aMEM and dispensed into wells of an opaque 96-well assay plate, 50 ^il/well. 

Plates are loaded onto an MLX microtiter plate iuminometer (Dynex Techaiologies, 
Inc., Chantilly, VA). The instrument is programmed to dispense 50 fil of cell 
suspension into each well, one well at a time, and immediately read luminescence for 
15 seconds. Dose-response cui-ves for the modulator candidates are constructed using 

10 the area under the curve for each light signal peak. Data are analyzed with 

SlideWrite, using the equation for 1-site llgand, and EC50 values are obtained. 
Changes in luminescence caused by the drugs are considered indicative of modulatory 
activity. Modulators that act as receptor agonists which couple to the Gq subtype of 
G-proteins give an increase in luminescence of up to 100 fold. Modulators that act as 

15 inverse agonists will reverse this effect at receptors that are either constitutively active 

or activated by known agonists. 

C. Luciferase Reporter Gene Assay 

The photoprotein luciferase provides another useful tool for assaying 

20 for modulators of GPCR activity. Cells (e.g., CHO cells or COS 7 cells) are 

transiently co-transfected with both a GPCR expression construct (e.g., GPCR- 
encoding sequence in pzeoSV2 (Invitrogen, San Diego, CA)) and a reporter construct 
which includes a gene for the luciferase protein downstream from a transcription 
factor, either cAMP-response element (CRE), AP-1, or NF kappa B. Agonist binding 

25 to receptors coupled to the Gs subtype of G-proteins leads to increases in cAMP, 

activating the CRE transcription factor and resulting in expression of the luciferase 
gene. Agonist binding to receptors coupled to the Gq subtype of G-protein leads to 
production of diacylglycerol that activates protein kinase C. As a result, the AP-1 or 
NF kappa B transcription factors are activated which stimulate expression of the 

30 luciferase gene. Expression levels of luciferase reflect the activation status of the 

signaling events. [See generally George et ai, Journal of Biomolecular Screening, 
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2(4): 235-40 (1997); and Stratowa et uL, Current Opinion in Biotechnology , 6: 574-81 
(1995).] Luciferase activity maybe quantitatively measured using, e.g., iuciferase 
assay reagents that are commercially available from Promega (Madison, Wl). 

Jn one exemplaiy assay, CHO cells are plated in 24-wel] culture dishes 
5 at a density of 100,000 cells/well one day prior to transfection and cultured at 37°C in 

aMEM (Gibco/BRL, Gaithersburg, MD) supplemented with 10% FBS, 2 mM 
glutamine, lOU/ml penicillin and 10 |ig/ml streptomycin. Cells are transiently 
co-transfected with both a GPCR expression construct and a reporter construct 
containing the luciferase gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase 

1 0 and NF kappa B-luciferase may be purchased from Stratagene (LaJolla, CA), 
Transfections are performed using FuGENE 6 transfection reagent 
(Boehringer-Mannheim), and the protocol provided in the product insert. Cells 
transfected with the reporter construct alone are used as a control. Twenty-four hours 
after transfection, cells are washed once with phosphate buffered saline (PBS) 

1 5 pre-warmed to 37°C. Serum-free aMEM is then added to the cells either alone 
(control) or with one or more candidate modulators and the cells are incubated at 
37°C for five hours. Thereafter, cells are washed once with ice cold PBS and lysed by 
the addition of 100 p,l of lysis buffer/well (from luciferase assay kit, Promega, 
Madison, WI). After incubation for 15 minutes at room temperature, 1 5 ^il of the 

20 lysate is mixed with 50 \il substrate solution (Promega) in an opaque white 96-well 
plate, and the luminescence is read immediately on a Wallace model 1450 MicroBeta 
scintillation and luminescence counter (Wallace Instruments, Gaithersburg, MD). 

Differences in luminescence in the presence versus the absence of a 
candidate modulator compound are indicative of modulatory activity. Receptors that 

25 are either constitutively active or activated by agonists give a 3-20 fold stimulation of 

luminescence compared to cells transfected with the reporter gene alone. Modulators 
that act as inverse agonists will reverse this effect. 

D. Intracellular Calcium Measurement using FLIPR 

30 Changes in intracellular calcium levels, are another recognized 

indicator of G protein-coupled receptor activity, and such assays can be employed to 
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evaluale moclulalors of GPCR activity. For example, CHO cells stably transfected 
with a GPCR expression vector are plated at a density 0^4 x 10*^ cells/well in Packard 
black-walled 96-vvell plates specially designed to isolate fluorescent signal to 
individual wells. The cells are incubated for 60 minutes at 37X in modified 
5 Dulbecco's PBS (D-PBS) containing 36 mg/L of pyruvate and 1 g/L of glucose with 

the addition of 1% FBS and one of four calcium indicator dyes (Fluo-3™ AM, FIuo- 
4™ AM, Calcium Green'^'^-I AM, or Oregon Green*^^* 488 BAPTA-1 AM) at a 
concentration of 4 ^M. Plates are washed once with modified D-PBS without 1% 
FBS and incubated for 10 minutes at 37X to remove residual dye from the cellular 

10 membrane. In addition, a series of washes with modified D-PBS without 1% FBS is 
performed immediately prior to activation of the calcium response. 

Calcium response is initiated by the addition of one or more candidate 
receptor agonist compounds, calcium ionophore A23 187 (10 jiM), or ATP (4 jiM). 
Fluorescence is measured by Molecular Device's FLIPR with an argon laser, 

15 excitation at 488 run. [See, e.g,, Kuntzweiler et ai. Drug Development Research, 

44(1): 14-20(1998).] The F-stop for the detector camera was set at 2.5 and the lengtli 
of exposure was 0.4 milliseconds. Basal fluorescence of cells was measured for 20 
seconds prior to addition of agonist, ATP, or A23 187, and was subtracted from the 
response signal. The calcium signal is measured for approximately 200 seconds, 

20 taking readings every two seconds. Calcium ionophore and ATP increase the calcium 
signal 200% above baseline levels. In general, activated orphan GPCRs increase the 
calcium signal approximately 10-15% above baseline signal. 

E. Mitogenesis Assay 
25 hi mitogenesis assays, the ability of candidate modulators to induce or 

inhibit GPCR-mediated cell growth is determined. [See, e.g., Lajiness et al., Journal 

of Pharmacology and Experimental Therapeutics, 267(3): 1573-81 (1993).] 

For example, CHO cells stably expressing a GPCR are seeded into 96- 

well plates at a density of 5000 cells/well and grown at 37''C in aMEM supplemented 
30 with 10% fetal calf serum. After 48 hours, the cells are rinsed twice with serum-free 

aMEM and 80 \i\ of fresh aMEM, or aMEM containing a known mitogen, is added 



wo 01/31014 



PCT/USOO/29601 



- 125- 

along with 20 \i\ aMEM containing varying concentrations of one or more test 
compounds diluted in serum free media. As controls, some wells on each plate 
receive serum- free media alone, and some receive media containing 10% FBS. 
Untransfected cells or cells transfected with vector alone also may serve as controls. 
5 Afterculture for 16-18 hours, I ^Ci/well of [^H]-thymidine (2 

Ci/mmol; cpm) is added to the wells and cells are incubated for. an additional 2 hours 
at 37°C. The cells are trypsinized and harvested onto filter mats with a cell harvester 
(Tomtec) and the filters are counted in a Betaplate counter. The incorporation of ^H- 
thymidine in semm-free test wells is coinpared to the results achieved in cells 

10 stimulated with serum. Use of multiple concentrations of test compounds permits 

creation and analysis of dose-response curves using the non-linear, least squares fit 
equation: A = B x [C/ (D + Q] + G where A is the percent of serum stimulation; B is 
the maximal effect minus baseline; C is the EC^q, D is the concentration of the 
compound; and G is the maximal effect. Parameters B, C and G are determined by 

15 Simplex optimization. 

Agonists that bind to the receptor are expected to increase 
pH]-thymidine incorporation into cells, showing up to 80% of the response to serum. 
Antagonists that bind to the receptor will inhibit the stimulation seen with a known 
agonist by up to 1 00%. 

20 

F. P^SIGTPvS Binding Assay 

Because G protein-coupled receptors signal through intracellular "G 

proteins" whose activity involves GTP/GDP binding and hydrolysis. Another 

indicator of GPCR modulator activity is measuring binding of the non-hydrolyzable 
25 GTP analog p^SJGTPyS in the presence and absence of putative modulators. [See, 

e.g., Kowal, etaL, Neuropharmacology, 37: 179-87 (1998).] 

In one exemplary assay, cells stably transfected with a GPCR 

expression vector are grown in 10 cm dishes to subconfluence, rinsed once with 5 ml 

of ice cold Ca^VMg^^ free PBS, and scraped into 5 ml of the same buffer. Cells are 
30 pelleted by centrifugation (500 x g, 5 minutes), resuspended in TEE buffer (25 mM 

Tris, 5 mM EDTA, 5 mM EGTA, pH 7,5) and frozen in liquid nitrogen. After 
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thawing, the cells are homogenized using a dounce (one ml TEE per plate of cells), 
and centrifuged at 1,000 x g for 5 minutes to remove nuclei and unbroken cells. 

The homogenale supernatant is centrifuged at 20,000 x g for 20 
minutes to isolate the membrane fraction. The membrane pellet is then washed once 
5 with TEE and resuspcnded in binding buffer (20 mM HEPES, pH 7.5, 1 50 mM NaCl, 

10 mM MgClj, 1 mM EDTA). The resuspended membranes can be frozen in liquid 
nitrogen and stored at -lO^'C until use. 

Aliquots of celJ membranes prepared as described above and stored at 
are thawed, homogenized, and diluted to a concentration of 10-50 [ig/ml in 
10 buffer containing 20 mM HEPES, 10 mM MgClj, 1 mM EDTA, 120 mM NaCl, 10 
|j.M GDP, and 0.2 mM ascorbate. In a final volume of 90 p.!, homogenates are 
incubated with varying concentrations of putative modulator compounds or 1 00 ^M 
GTP for 30 minutes at 30°C and then placed on ice. To each sample, 10 ^il guanosine 
5'-0-(3[^^S]thio) triphosphate (NEN, 1200 Ci/mmol), ([^^SJ-OTPyS), was added to a 
1 5 final concentration of 100-200 pM. Samples are incubated at 30''C for an additional 

30 minutes. The reaction is then stopped by the addition of 1 ml of 10 mM HEPES, 
and 10 mM MgClj (pH 7.4), at 4°C, and filtration. 

Samples are filtered over Whatman GF/B filters. These filters are 
washed with 20 ml ice-cold 10 mM HEPES (pH 7.4) and 10 mM MgCl^ and counted 
20 by liquid scintillation spectroscopy. Nonspecific binding of [-^^SJ-GTPyS is measured 

in the presence of 1 00 ^iM GTP and subtracted fi-om the total. Compounds are 
selected that modulate the amoimt of [^^SJ-CTPyS binding in the cells, compared to 
untransfected control cells. Activation of receptors by agonists gives up to a five-fold 
increase in [^^SJGTPyS binding. This response is blocked by antagonists. 

25 

G. MAP Kinase Activity Assay 

Evaluation of MAP Kinase acfivity in cells expressing a GPCR provide 
another assay to identify modulators of GPCR activity. [See, e.g., Lajiness et ai. 
Journal of Pharmacology and Experimental Therapeutics, 267(3): 1573-81 (1993); 
30 and Boulton et ai. Cell, 65: 663-75 (1991).] 
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In one embodiment, CHO cells slably transfected with a GPCR- 
encoding polynucleotide are seeded into 6 well plates at a density oF 70,000 cells/well 
48 hours prior to the assay. During this lime, the cells are cultured at 37°C in aMEM 
media supplemented with 10% FBS, 2 mM glutamine, 10 U/ml penicillin and 10 
5 ^Ag/nii streptomycin. The cells are serum starved for 1-2 hours prior to the addition of 

stimulants. 

For the assay, the cells are treated with media alone or media 
containing a putative agonist or phorbal ester-myistoyl acetate (PMA) as a positive 
control. After treatment, cells ai'e incubated at 37''C for varying times. To stop the 

10 reaction, the plates are placed on ice, the media is aspirated, and the cells are rinsed 
with 1 ml of ice-cold PBS containing 1 mM EDTA. Thereafter, 200 ^il cell lysis 
buffer (12.5 mM MOPS (pH 7.3), 12.5 mM p-glycerophosphate, 7.5 mMMgClj, 0.5 
mM EGTA, 0.5 mM sodium vanadate, 1 mM benzamidine, 1 mM dithiothreitol, 10 
[ig/ml leupeptin, 10 fig/ml aprotinin, 2 p-g/ml pepstatin A, and 1 [iM okadaic acid) is 

1 5 added to the cells. The cells are scraped from the plates and homogenized by 10 

passages through a 23 3/4 gauge needle. The cytosol fraction is prepared by 
centrifligation at 20,000 x g for 1 5 minutes. 

Aliquots (5-10 ^il containing 1-5 ^g protein) of cytosols are mixed with 
I mM MAPK Substrate Peptide (APRTPGGRR; SEQ TD NO: 25); Upstate 

20 Biotechnology, Inc., N.Y.) and 50 ^M [y-^'P]ATP, (NEN, 3000 Ci/mmol) diluted to a 

final specific activity of --2000 cpm/pmol in a total volume of 25 ^1. The samples are 
incubated for 5 minutes at 30T, and reactions are stopped by spotting 20 |il on 2 cm^ 
of Whatman P81 phosphocellulose paper. The filter squares are washed in 4 changes 
of 1% H3PO4, and the squares are counted by liquid scintillation spectroscopy. 

25 Equivalent cytosolic extracts are incubated without MAPK substrate peptide, and the 

cpm ft-om these samples are subtracted from the matched samples with the substrate 
peptide. The cytosolic extract from each well is used as a separate point. Protein 
concentrations are determined by a dye binding protein assay (Bio-Rad). Agonist 
activation of the receptor is expected to result in up to a five fold increase in MAPK 

30 enzyme activity. This increase is blocked by antagonists. 
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H. l-^HIArachidonic Acid Release 

The activation of GPCR's also has been observed to potentiate 
arachidonic acid release in cells, providing yet another useful assay for modulators of 
the activity of GPCR's of the present invention. [See, e.g., Kantemian et ciL, 
5 Molecular Pharmacology, 39: 364-9 (1 99 1 ).] For example, CHO ceils that are stably 

transfected with a GPCR expression vector are plated in 24-welI plates at a density of 
1 5,000 cells/well and grown in aMEM media supplemented with 1 0% FBS, 2 mM 
glutamine, 10 U/ml penicillin and 10 \ig/m\ streptomycin for 48 hours at before 
use. Cells of each well are labeled by incubation with [■'HJarachidonic acid 

1 0 (Amersham Corp., 210 Ci/mmol) at 0.5 \iC\/ml in 1 ml aMEM supplemented with 10 

mM HEPES (pH 7.5), and 0.5% fatty-acid- free bovine serum albumin for 2 hours at 
SV^'C. The cells are then washed twice with 1 ml of the same buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, 
either alone or containing 10 ^iM ATP (Adenosine 5 '-triphosphate) and the cells are 

1 5 incubated at 3TC for 30 minutes. Buffer alone and mock transfected cells are used as 

controls. Samples (0.5 ml) from each well are counted by liquid scintillation 
spectroscopy- Agonists which activate the receptor will lead to potentiation of the 
ATP-stimulated release of [^H] -arachidonic acid. This potentiation is blocked by 
antagonists. 

20 

I. Extracellular Acidification Rate 

In yet another assay, the effects of putative modulators of GPCR 
activity are assayed by monitoring extracellular changes in pH induced by the putative 
modulators. [See, e.g., Dunlop et aL, Journal of Pharmacological and Toxicological 

25 Methods, 40(1): 47-55 (1998).] 

CHO cells transfected with a GPCR expression vector are seeded into 
12-mm capsule cups (Molecular Devices Corp.) at 4 x 10^ cells/cup in aMEM 
supplemented with 10% FBS, 2 mM 1 -glutamine, 10 units/ml penicillin, and 10 ^ig/ml 
streptomycin. The cells are incubated in this media at 37°C in 5% CO2 for 24 hours. 

30 Extracellular acidification rates are measured using a Cytosensor 

microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 
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sensor chambers ofthe microphysiometer and the chambers are perfused with running 
buffer (bicarbonate free aMEM supplemented with 4 niM 1-glutamine, 10 iinils/ml 
penicillin, 10 ng/ml streptomycin, 26 mM NaCl) at a flow rate of 100 fxl/min. 
Agonists or other agents are diluted into the ninning buffer and perfused through a 
5 second (laid path. During each 60 second pump cycle, the pump is run for 38 seconds 

and is off for the remaining 22 seconds. The pH ofthe running buffer in the sensor 
chamber is recorded during the cycle from 43-58 seconds, and the pump is re-started 
at 60 seconds to start the next cycle. The rate of acidification of the running buffer 
during the recording time is calculated by the Cytosoft program. Changes in the rates 
10 of acidification are calculated by subtracting the baseline value (the average of 4 rate 
measurements immediately before addition of modulator candidates) from the highest 
rate measurement obtained after addition of a modulator candidate. The selected 
instrument detects 61 mV/pH unit. Modulators that act as agonists at the receptor 
result in an increase in the rate of extracellular acidification as compared to the rate in 
1 5 the absence of agonist. This response is blocked by modulators which act as 

antagonists at the receptor. 

EXAMPLE 7 
Luciferase Reporter Gene Assays 
Luciferase reporter gene assays (essentially as described in Example 6) 
were carried out to measure signaling activity of the GPCR receptors when coupled to 
Gs, Gi or Gq G-proteins. Activation of Gs coupled receptors results in stimulation of 
intracellualar cAMP production which leads to activation of the transcription factor 
cychc AMP response element (CRE). Therefore activation of Gs coupled receptors 
can be detected by measuring transcription and translation of the reporter gene CRE- 
luciferase. The level of expression ofthe CRE reporter gene is dependent on the 
intracellular level of cAMP. Similarily, activation of Gs, Gi or Gq coupled receptors 
will result in activation of the AP-1 transcription factor. Expression ofthe AP-l 
transcription factor can be attributed to changes in cAMP levels and/or increases in 
the levels of intracellular calcium and therefore can be an indication of G-protein 
coupled receptor activation. 



25 
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CHO 10001 A cells (Gottesman et ciL, Somatic Cell Genetics 6: 45-61, 
1980) were maintained in Minimal Essential iVfedium (MEM) supplemented with 
10% FBS (Hyclone Laboratories, Inc., Logan, Utah) at 37T in an atmosphere of 5% 
COj. The cells were split 1:5 twice a week for niaintence. Plasmids used in the 
5 experiments were propogated in E,coli strain DH5 (Gibco BRL) and purified using 

the Qiagen Maxi-prep plasmid purification system according to the manufacturer's 
instructions. 

One day prior to transfection, 1x10^ CHO cells/well were plated on 24 
well culture plates and allowed to adhere overnight. Each well on the plate was 

10 transfected with 0.5 ^tg of either AP-l luciferase (Stratagene,, LaJolla, CA) or CRE 

luciferase plasmid alone or in combination with 0.125 |ig of a GPCR plasmid (GPCR 
DNA inserted into the pCDNAB vector form Invitrogen). Cell were transiently 
transfected with the commercially available transfection reagent FUGENE-6 
according the manufacturer's instructions (Boehringer Mannheim, Indianapolis, IN). 

1 5 Twenty- four hours after transfection, the cells were washed in PBS 

pre-warmed to 37°C. Agonists and antagonists were diluted in pre-warmed serum- 
free MEM, added to the transfected cells and incubated at 37°C, 5% COj for 5 hours. 
Subsequently, the cells were washed once in ice cold PBS and lysed with the addition 
of 100 ^1 of lysis buffer (Promega) to each well, fter a 15 minute incubation at room 

20 temperature, luciferase reporter gene activation was analyzed with the Luciferase 

Assay Reagents conmiercially available from Promega (Madison. Wl). An alloquot of 
lysate (15 ^1) was mixed with 50 ^ll of substrate solution in an opaque white 96 well 
plate. The luminescence from the plate was read in a Wallance 1450 MicroBeta 
scintillation and luminscence counter (Wallac Instruments, Gaithersburg, MD). 

25 Constitutive GPCR activity was calculated as activity measured in GPCR transfected 
cells divided by activity measured in control cells (control cells= luciferase- ' 
transfected cells in the absence of GPCR plasmid). The measurements of GPCR 
constitutive activity (as a percentage of control measurements) are summarized in the 
table below; 



30 
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LKL Aciivitv 


AP-1 Activitv 


1 v5 


1 TOO/ 


1 00% 


cuiN ly / 


lo!)7o 


1 00%) 




1 TOO/ 

1 78% 


146% 


LL)N203 


1 00% 


468% 


CON215 


173% 


307% 


CON222 


100% 


100% 


CON202 


135% 


336% 


CON166 


115% 


100% 


CON217 


211% 


100% 



These results provide useful information for designing screening assays 
to identify molecules (natural or artificial) that activate or inhibit the GPCR's of the 
invention. For example, compound libraries can be screened using the AP-1 

1 5 luciferase (for CON198, CON203, C0N2 1 5, or CON202) or the CRE-luciferase assay 

(for CON193, CON197, CON198, CON215, CON202, and CON166) to identify 
compounds which increase the signaling activity in GPCR polypeptide expressing 
cells as compared to receptor negative cells. The identified compounds may be useful 
for predicting endogenous ligands for the GPCR polypeptides, for measuring the 

20 physiological effects of GPCR activation in animal models, and for designing 

therapeutics to modulate GPCR activity to treat disease states. 

EXAMPLE 8 
Chromosomal Localization of GPCR 

25 The following example pertains to chromosomal localization of GPCR 

genes of the present invention (e.g., CON193, CON166, CON103, CON203, 
CON198, CON197, CON202, CON222, CON215, or CON217). The chromosomal 
localization permits use of the GPCR polynucleotide sequences (including fragments 
thereoQ as chromosomal markers to assist with genome mapping and to provide 

30 markers for disease states. Chromosomal localization also pemiits correlation of the 
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GPCR's of the invention with disease slates in which abeirant activity of the GPCR is 
implicated, especially disease states that have previously linked (or will be linked) 
with mutations, polymorphisms, chromosomal rearrangements, and other 
chromosomal changes near the locus of the GPCR gene. 

5 

A, CON197 

Chomosomal localization of the gene encoding CONl 97 (SEQ ID NO: 
1 1) was determined using the Standford G3 Radiation Hybrid Panel (Research 
Genetics, hie. Huntsville, AL). This panel contains 83 radiation hybrid clones of the 

1 0 entire human genome as created by the Stanford Human Gemone Center (Stanford, 
California). PGR was carried out with each clone within the Hybrid Panel and the 
results were submitted to the Standford Human Genomic Center via e-mail for 
analysis (http://www.shgc.standford.edu/RH/rhserverformnew.html). 

PGR reactions were carried out with the Expand Hi-Fi PGR System''^'^ 

1 5 according the manufacturer's instructions (Roche Molecular Biochemicals, 

Indianapolis, IN). Primers, synthesized by Genosys Corp. (The Woodlands, TX), 
were designed to generate a 10 base pair fragment of CON197-encoding DNA in the 
presence of the appropriate genomic DNA. The forward primer, denoted as LW1332 
(TCCTACTGTGATGAACCC; SEQ ID NO: 74), corresponded to nuceotides 396 

20 through 413 of SEQ ED NO: 11. The reverse primer, denoted as LW1333 

(CAGAAGAAGTTGTCCAGC; SEQ ID NO: 75), corresponded to the complement 
of nucleotides 519 through 536 of SEQ ID NO: 1 1 . Each reaction contained 25 ng of 
DNA from a hybrid clone, 60 ng of Primer LW1332, and 60 ng of Primer LW1333 
resulting in a final volume of 15 |iL The PGR reactions were carried our in a 

25 GeneAmp 9700 PGR thermocycler (Perkin Elmer Applied Biosystems) under the 

following conditions: 94''C for 3 minutes followed by 35 cycles of 94''C for 30 
seconds, 52'C for 1 minute, and 72T for 2 minutes. The PGR reactions were then 
analyzed on a 2,0% agarose gel and stained with ethidium bromide. The lanes were 
scored for the presence of the 140 base pair PGR product. 

30 The G3 Hybrid Panal analysis revealed that the CON 197 gene (SEQ ID 

NO: 1 1) was localized to chromosome 14, most nearly linked to Standford marker 
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SHCC-10764 wiih a LOD score of 9.10. The SHGC-10764 marker lies at position 
IqlM. 

B. CQN202 

Chomosomal localization of the gene encoding CON202 (SEQ ID NO: 
13) was determined using the Standford G3 Radiation Hybrid Panel (Research 
Genetics, Inc. Huntsville, AL), This panel contains 83 radiation hybrid clones of the 
entire human genome as created by the Stanford Human Gemone Center (Stanford, 
CaMfornia). PGR was carried out with each clone within the Hybrid Panel and the 
results were submitted to the Standford Human Genomic Center via e-maii for 
analysis (http://www.shgc,standford.edu/RH/rhsei-verformnew.html). 

PGR reactions were carried out with the Expand Hi-Fi PGR System^*^ 
according the manufacturer's instructions (Roche Molecular Biochemicals, 
Indianapolis, IN). Primers, synthesized by Genosys Corp. (The Woodlands, TX), 
were designed to generate a 250 base pair fragment of CON202-encoding DNA in the 
presence of the appropriate genomic DNA. The forward primer, denoted as LW1480 
(GGTTCTACCTGGACTTATGG; SEQ ID NO: 70), corresponded to nuceotides 515 
through 534 of SEQ ID NO: 13. The reverse primer, denoted as LW1481 
(TAATGAATGAGTAAGTGCCC; SEQ ID NO: 71), corresponded to the 
complement of nucleotides 745 through 764 of SEQ ID NO: 13. Each reaction 
contained 25 ng of DNA from a hybrid clone, 60 ng of Primer LW1480, and 60 ng of 
Primer LWl 481 resulting in a final volume of 15 [il. The PGR reactions were carried 
our in a GeneAmp 9700 PGR thermocycler (Perkin Elmer AppUed Biosystems) under 
the following conditions: 94^G for 3 minutes followed by 35 cycles of 94^C for 30 
seconds, 52T for 1 minute, and 72°C for 2 minutes. The PGR reactions were then 
analyzed on a 2.0% agarose gel and stained with ethidium bromide. The lanes were 
scored for the presence of the 250 base pair PGR product. 

The G3 Hybrid Panal analysis revealed that the GON202 gene (SEQ ID 
NO: 1 3) was localized to chromosome 7, most neariy linked to Standford marker 
SHGG-12021 with a LOD score of 10.36. The SHGC-12021 marker lies at position 
7q2 1 . There is evidence that schizophrenia is linked to chromosome 7q22, and 
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therefor any genes localized to this region are candidates for disease involvement or 
susceptibility. [See Ekeliind et. al., I-Iuman Moi Genetics 9(7): J049-I057 (2000); 
Faraone et al.. Am. J. Med, Genet, 81: 290-295 (September, 1998); and Blouin et al., 
Nat, Genet.. 20: 70-73 (1998)]. The SHGC-12021 marker is proximal to 7c|22 (--l 
cM) and therefore may be associated with schizophrenia susceptibility. 

In particular, G protein-coupled receptors, such as CON202 
polypeptide, have the biochemicaJ and functional potential to play a role in the disease 
process of schizophenia, CON202 is an attractive target for screening for ligands 
(natural and synthetic) that are useful in modulating cellular processes involved in 
schizophrenia. In addition, the chromosomal localization data (especially coupled 
with CON202 expression patterns in the brain) identifies CON202 as a candidate for 
screening healthy and affected (schizophrenia) individuals for CON202 allelic 
variants, mutations, duplications, rearrangements, and other chromosomal variations 
that correlate with the disesase state. Variations that conelate with disease state are 
useful for diagnosis of disease or disease susceptibility. CON202 constuicts 
containing the variations are useful for designing targeted therapeutics for treatment 
of the disease (e.g., by using the assays for modulators described in preceding 
examples. 

C. High throughput Analysis 

The EMBL High Throughput Genome database (provided by the 
European Bioinformics Institute) was searched with GPCR nucleotide sequences to 
determine chromosomal localization for CON193, CON166, CON103, CON203, 
CON! 98, and CON215 genes. The results are summarized in the table below: 
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GPCR 

CON 193 
CON 166 
CON 103 
CON203 
CON 198 
CON215 



SEP ID NO: 
1 

3 
5 
7 
9 
17 



Chomosome 
Localization 

II 

X 
2 
3 
11 
3 



Based on Genbank 
Accession No. 

AC026090 

AC02I992 

AC013396 

AC024886 

AC025249 

AC024886 



While tlie present invention has been described in terms of specific 
1 0 embodiments, it is understood that variations and modifications will occur to those in 

the art, all of which are intended as aspects of the present invention. Accordingly, 
only such limitations as appear in the claims should be placed on the invention. 



Summary of Sequences: 



15 


SEOID NO. 


Description 




1 


CON 193 DNA 




2 


CON 193 protein 




3 


CON 166 DNA 




4 


CON 166 protein 


20 


5 


CON 103 DNA 




6 


CON 103 protein 




7 


CON 203 DNA 




8 


CON 203 protein 




9 


CON 198 DNA 


25 


10 


CON 198 protein 




11 


CON 197 DNA 




12 


CON 197 protein 




13 


CON 202 DNA 




14 


CON 202 protein 


30 


15 


CON 222 DNA 




16 


CON 222 protein 




17 


CON 215 DNA 
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SEP [P NO. Description 



18 CON 215 protein 

19 CON217DNA 

20 CON 217 protein 

2 1 PGR primer LW 1 282 for CON ] 93 
5 22 PGR primer LW 1 283 for CON 1 93 

23 PCR primer LW 1 372 for CON 1 93 

24 PCR primer LW 1374 for CON 193 

25 MAPK. Substrate Peptide 

26 Primer LW 1248 for CON 193 to generate insitu hybridization probe 
1 0 27 Primer LW 1249 for CON 1 93 to generate insitu hybridization probe 

28 PCR primer LW 1278 for CON 166 

29 PCR primer L W 1 279 for CON 1 66 

30 PCR primer LW 1405 for CON 166 

31 PCR primer LW 1406 for CON 166 
1 5 32 PCR primer LW 1 280 for CON 1 03 

33 PCR primer LW 1281 for CON 103 

34 PCR primer LW 1385 for CON 103 

35 PCR primer LW 1 386 for CON 1 03 

36 PCR primer LW 1 329 for CON 203 
20 37 PCR primer LW 1377 for CON 203 

38 PCR primer LW 1387 for CON 203 

39 PCR primer LW 1388 for CON 203 

40 Primer LW 1 3 14 for CON 203 to generate insitu hybridization probe 

41 Primer LW 1 31 5 for CON 203 to generate insitu hybridization probe 
25 42 PCR primer LW 1 326 for CON 1 98 

43 PCR primer LW 1 327 for CON 1 98 

44 PCR primer LW 1 41 5 for CON 1 98 

45 PCR primer LW 1 4 1 6 for CON 1 98 

46 Primer LW 1 308 for CON 1 98 to generate insitu hybridization probe 
30 47 Primer LW 1309 for CON 198 to generate insitu hybridization probe 

48 PCR primer LW 1324 for CON 1 97 

49 PCR primer LW 1 325 for CON 1 97 

50 Primer LW 1306 for CON 197 to generate insitu hybridization probe 

5 1 Primer LW 1 307 for CON 1 97 to generate insitu hybridization probe 
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SEQ IP NO. Description 



52 PGR primer GV 599 for CON 202 

53 PGR primer GV 600 for CON 202 

54 PGR primer LW 1 482 for CON 202 

55 PGR primer LW 148 for CON 202 

5 56 Primer LW 1310 for CON 202 to generate insilu hybridization probe 

57 Primer LW 1311 for CON 202 to generate insitu hybridization probe 

5 8 PGR primer L W 1 442 for CON 222 

59 PGR primer LW 1443 for CON 222 

60 PGR primer LW 1 440 for CON 222 
10 61 PGR primer LW 1 441 for CON 222 

62 Primer LW 1472 for CON 222 to generate insitu hybridization probe 

63 Primer LW 1473 for CON 222 to generate insitu hybridization probe 

64 Primer LW 1411 for CON 215 to generate insitu hybridization probe 

65 Primer LW 1412 for CON 21 5 to generate insitu hybridization probe 
1 5 66 PGR primer LW 1448 for CON 21 7 

67 PGR primer LW 1 449 for CON 2 1 7 

68 Primer LW 2 1 7A for CON 2 1 7 to generate insitu hybridization probe 

69 Primer LW 2 1 8B for CON 2 1 7 to generate insitu hybridization probe 

70 Primer LW 1480 for CON 202 chromosomal localization 
20 71 Primer LW 1 481 for CON 202 chromosomal localization 

72 Primer CON103a for CON 103 to generate insitu hybridization probe 

73 Primer CONl03b for CON 1 03 to generate insitu hybridization probe 

74 Primer LW 1332 for CON 197 chromosomal localization 

75 Primer LW 1333 for CON 197 chromosomal localization 



25 
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B-30250 




C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additionai sheet 


m 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71), samples of nriaterials deposited in accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who is: a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the funnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorizttd officer . . 



□ 



For International Bureau use only 



This sheet was received by the International Bureau on: 



Authorized ofHcer 



Form PCT/RO/134 (July 1992) 



l^Siar 1997. Fam PCTM5 



wo 01/31014 



PCT/USOO/29601 



-139- 



Applicant's or agent's file 

reference number 28341/6276P 



Internationaljipplicaiion No 
To Be l}<?iSifTyfr*** ^ 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 136/5) 



A. The indications made below relate to the microorganism referred to in the description 
on page 90; 98 .line lO-lA; 15 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street. Peoria, Illinois 61604 U.S.A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30250 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet |X 



In respect of those designations in which a European patent or a patent in Norway is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the corresponding information concerning the patent in Norway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations in Norv\^y). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



t. SEPARATE FURNISHING OK INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g. 
"A ccessio n Number of Deposit ") 



For receiving Office use only 



This sheet was leceivcd with the international application 



AuihorizjEd officer a 



For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/I34 (July 1992) 



togalStar 1897. Form PCTM5 



wo 01/31014 



PCT/USOO/29601 



-140- 



Applicani's or agent's file 

reference number 28341/bT/'bP 



International aj?|)!icatjon N< 
To Be DetpiSTh 



INDICATIONS RELATING TO A DEPOSITED Ml CROORGANISM 

(PCT Rule 13m 



A. The indicaiions made below relate to the microorganism referred to in the description 
on page 91; 98 jine 10-14; 16 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (inctuding postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service. U.S. Department of Agriculture 
181 5 North University Street, Peoria. Illinois 61604 U.S. A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30248 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet X 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71), samples of materials deposited in accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who is: a skilled addressee without an interest in the invention; and nominated by a 
person who nnakes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifihe indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general natiifv of the indications e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



For International Bureau use only 

sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July I9?2) 



LeoalStar 1997. Form PCTM5 



wo 01/31014 PCT/USOO/29601 

-141- 



Applicant's or agent's file 
reference number 



International apJplication^Nc 
To Be Deterrr 



INDICATIONS RELATING TO A DEPOSITED MICROORGAMSiM 

{PCT Rule \2bis) 



A. The indications made below relate to the microorganisni referred to in the description 
on page 91; 98 Jinc 10-14; 16 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet [X 



Nanne of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (inducing postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 


Accession Number 


18 January 2000 


B-30248 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [X^ 



In respect of those designations in which a European patent or a patent in Norway Is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the corresponding information concerning the patent In Nonvay or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations In Norway). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be subniilled to the International Bureau later (specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authoriaed officer * 



□ This 



For International Bureau use only — 
sheet was received by Che Intemational Bureau on: 



Authorized officer 



Fomi PCT/RO/134 (July 1992) 



LeaalSiar i997. Form PCTM5 



wo 01/31014 PCT/USOO/29601 

-142- 



Applicant's or agent's file 


International application N 


reference number 2834 l.o*.. w. 


To Be Deterirnrrr^'j \iQ 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCTRule I2bh) 



A. The indications made below relate to the microorganism referred to in the description 
on page 92-; 98 .line 10-14; 17 



B. IDENTI FICATION OF DEPOSIT Farther deposits arc idcntir.cd on an additional sheet X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (inciuding postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street. Peoria, Illinois 61604 U.S. A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30247 


C. ADDITIONAL \t^DlCAT\Or<S (leave blank if not applicable) 


This information is continued on an additional sheet [X 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71), sannples of materials deposited in accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who is: a skilled addressee without an interest In the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



C. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., 
"A ccession Number of Deposit ") 



O This 



For receiving Office use only 



sheet was received with the international application 



Authorized offlter 



□ 



For International Bureau use only 



This sheet was received by the International Bureau on; 



Authorized officer 



Form PCT/RO/134 (July 1992) 



LeoalStar t997. Form PC7M5 



wo 01/31014 PCT/USOO/29601 

-143- 



Applitam's or agent's file * - 


1 nternatio n a 1 appj i caliorj_^h ' 




reference number 


To Be Detei 

' — " • ' — «. 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRiile \2t>is) 



A. The indications made below relaie to the microorganism refcncd to in the description 
on page 92; 98 Jine 10-14^ 17 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet [X] 

Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 
18 January 2000 



Accession Number 
B-30247 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



X 



In respect of those designations in which a European patent or a patent in Norway is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patenter 
the corresponding information concerning the patent in Norway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample {Rule 28 (4) EPC and the corresponding regulations in Norway). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorizad officeK a 



For International Bureau use only 



I I This sheet was received by the International Bureau on; 



Authorized officer 



Form PCT/RO/134 (July 1992) 



LcoalStef 1997. FormPCTMS 



wo 01/31014 PCT/USOO/29601 

-144- 



Applicant's or agent's file 

reference number 28341yt>;;/t)K 



Internal tonal apDiicatign N 
To Be DetEniiifi 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \Zbis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 93; 98 .line 10^1-4: 18 



B. IDENTIFICATION OF DEPOSIT Further deposits are idcniiftcd on an additional sheet |X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S. A. 



Dale of deposit 
18 January 2000 


Accession Number 
B-30254 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 


X 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71), samples of materials deposited In accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who is: a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g.. 
"A ccession Number of Deposit ") 



This sh 



For receiving Office use only 



ect was received with the international application 



Authorized offiwr ^ 



For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/1 34 (July 1 992) 



LeoalSOr 1007. Form PCTMS 



wo 01/31014 






PCT/USOO/29601 




-145- 




Applicant's or agent's file 




Intemational — |;'-^»i'^n 




refer ence number 28341/62 /6P 




ToBeDete..._ 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 

on page ^ 93; 98 Jine 10*14: 18 

B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet [X 

Name of depositary institution 

Agricuttural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service. U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 


Accession Number 




18 January 2000 


B-30254 




C. ADDITIONAL INDICATIONS f7eave blank if not applicable) This information is continued on an additional sheet 


X 



In respect of those designations in which a European patent or a patent In Norway is sought, a sample of the 
deposited mlcroorganisnf^ will be made available until the publication of the mention of the grant of the European patent or 
the corresponding information concerning the patent in Norway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations in Nonway). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g.. 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Aiithoriaed offic« . 



For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



WegalStar 1»7. FormPCTMS 



wo 01/31014 PCT/USOO/29601 

-146- 



Applicant's or agent's file * 

reference number 2834l/b:^/bH 



Intern£rtinnal aoDlication I 
To Bet 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCFRule 136/0 



A. 'Hie indications made below relate to the microorganism referred to in the description 
on page 94: 98 _ .line 3-7; IQ 



B. IDENTIFICATION OF DEPOSIT Further deposits u,e idcntillcd on an additional sheet X 



Nome of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service. U.S. Department of Agriculture 
1815 North University Street. Peoria. IHinois 61604 U.S. A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30252 


C ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet X| 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71). samples of materials deposited in accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who is: a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all desigfiated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g.. 
"A ccession Number of Deposit ") 



For receiving Office use only 



, This sheet was received with the internationQl application 



Authorized officer/'\ /) 



□ 



For Iniernaiional Bureau use only 



This sheet was received by the International Bureau on; 



Authorized officer 



Form PCT/RO/1 34 (July 1992) 



LesalBter 1997. Form PCTM5 



wo 01/31014 PCT/USOO/29601 

-147- 



Applicant's or agent's file 

reference number 2834 de- 



International apDiication Ni 
To Be Deterrr 



INDICATIONS RELATING TO A DEPOSITED IMICROORGANISM 

(PCT Rule 13A/J)' 



A. The indications made below relate to the microorganism referred to in the description 
on page 9A; 98 ,line 3-7; 19 - 



B. IDENTIFICATION OF DEPOSIT Funher deposits are identiHcd on an additional sheet [X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service. U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30252 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [X| 



■ In respect of those designations in which a European patent or a patent in Norway is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the corresponding information concerning the patent tn Norway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations in Norway). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the genera/ nature of the indications e.g., 
"Accession Number of Deposit") 



For receiving Offtce use only 



This sheet was received with the international application 



For International Bureau use only 



I I This sheet was received by (he International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



LegalStar 1997. FormPCTMS 



wo 01/31014 PCT/USOO/29601 

-148- 



ApplicQnt's or ligem's file • 
reference number , 



Inter- 
To E 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \Zbis) 



A. The indications made below relate to the microorganisnn referred to in the description 
on page 94; 98 Jine 27-31; 20 



B. IDENTIFICATION OF DEPOSIT Further deposits are identiHcd on an additional sheet fX 



Name of depositary insiitttiion 

Agricultural Research Sen/ice Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service. U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S. A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30251 


C ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet X 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71), samples of materials deposited In accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who is: a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., 
"Accessioi\ Number of Deposit") 



For receiving Office use only 



This sheet was received with the rnremaiional application 



Authorizai officenTl 



□ 



For International Bureau use only 



This sheet was received by the Internationa! Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



LeoatSlar 1997. Form PCTM5 



wo 01/31014 PCT/USOO/29601 

-149- 



Applicant's or agent's file 


Internation-' — 




reference number 28341/bii /bP 


To Be Del 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3hf5) 



A. The indications made below relate to the microorganism referred to in the description 
on page 

94; 98 jine 27-31; 20 



B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet [X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. OepartnDent of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.SA 



Date of deposit 
18 January 2000 



Accession Number 
B-30251 



C. ADDITIONAL INDICATIONS (ieave blank if not applicable) This information is continued on an additional sheet X] 



In respect of those designations in which a European patent or a patent in Norway is sought, a sample of Ihe 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the corresponding Information concerning the patent in Nonway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations in Norway). 



D. DESIGNATED STATES FOR WHICH m\>\Q\T\OXiS ML^MAX>E (if the indications are not for all designated States) - 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g. 
"A ccession Number of Deposit") 



For receiving OfTice use only 



, This sheet was received with the international application 



Authoriztd officer 



Fur International Bureau use only 



I I This sheet was received by the International Bureau on; 



Authorized officer 



Form PCT/RO/I34 (July 1992) 



LogalSUf 1997. FormPCTMS 



wo 01/31014 PCT/USOO/29601 

-150- 



Applicant's or agent's file , 
reference number 2834* 



To Be Deterntinfe?!'' 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below rclaic lo (he microorganism referred to In the descripiion 
on page 95; 98 , line 19-23; 21 



B. IDENTIFICATION OF DEPOSIT p.^her deposits are identified on an additional sheet X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street. Peoria, Illinois 61604 U.S. A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30253 


C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 


X 



When designating Australia. In accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71). samples of materials deposited In accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who Is: a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., 
"Accession J^innher of Deposit") 



El 



For receiving Office use only 



This sheei was received with the international application 



AuthorizAi ofTtc/rX 7 ] 



□ 



For International Bureau use only 



This sheet was received by the International Bureau on: 



Authorized officer 



Fomi PCT/RO/134 (July 1992) 



LegalStar 1997. Torm PCTMS 



wo 01/31014 PCT/USOO/29601 

-151- 



Applicant's or agent's file 
reference number 28341, 



In tern at 10 nat a 
To Be Deterr 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3f)(s) 



A. The indications made below relate to the microorganism referred to in the description 
on page 95; 98 , line 19-23: 21 



B. IDENTIFICATION OF DEPOSIT p^^rther deposits arc identified on an additional sheet \X 



Name of dqposilary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 


Accession Number 




18 January 2000 


B-30253 




C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 





In respect of those designations in which a European patent or a patent in Norway is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the con-esponding information concerning the patent in Nonway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the conresponding regulations in Norway). 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorited ofTimr J 



For Internal tonal Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



LegatStof 1997. FormPCTMS 



wo 01/31014 PCT/USOO/29601 

-152- 



Applicant's or agent's flic ' 
i-cfcrencc nurtilKr 28341/ti:^rOK 



International apptirafinn t 
To Be Determin 



INDICATIONS RELATING TO A DEPOSITED MICROORGANlSiM 

(PCT Rule \3bis) 



A. The indications made bciow relate to the microorganism referred to In the description 
on page 96; 98 . ''"C 11-1 S; 22 



B. IDENTIFICATION OF DEPOSIT p^^j^er deposits are idcntiHcd on an additional sheet X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street, Peoria. Illinois 61604 U.S. A. 



Date of dcposil 
18 January 2000 



Accession Number 
B-30257 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet X| 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71). samples of materials deposited in accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or been 
withdrawn or refused: to a person who is: a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WH ICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



'I'he indications listed below will be submitted to the international Bureau later (specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



For receiving OfTice use only 



^4 This sheet was received with the international application 



Authqriatd office^ / . 



For International Bureau use only 



I I This sheet was received by the Internationa) Bureau < 



Authorized officer 



Fomi PCT/RO/134 (July 1992) 



LogalSlar 1987. Form PCTM5 



wo 01/31014 



PCT/USOO/29601 



-153- 



Applicant's or agent's flic 
reference number 28341i<j^.( 



Intematior 
To Be De 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule ]36u) 



A. The indications made below relate to the microorganism referred to in the description 
on page 

96; 98 .line 11-15 ; 22 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service. U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30257 


C. ADDITIONAL INDICATIONS (leave 


blank if not applicable) This information is continued on an additional sheet 


X 





In respect of those designations In which a European patent or a patent in Norway is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the corresponding information concerning the patent in Nonvay or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations in Norway). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g. 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 



For international Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/ 134 (July 1992) 



LegatStar 1997. Form PCTM5 



wo 01/31014 PCT/USOO/29601 

-154- 



Applicani's or agent's file 

reference number 2834 1 /ui / y r 



Intcrnatk>nal arolication Na^ 
To Be Detei 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc \3dis) 



A, The indications made below relate 1o the microorganism referred to in the description 
on page 97; 98 .line 4-8; 23 



B. IDENTIFICATION OF DEPOSIT p,.,^,,^, ^.p^^j.s 3,., jdcmificd on an additional sheet X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (tuchding postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U,S. Department of Agriculture 
1815 North University Street, Peoria. Illinois 61604 U.S. A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30255 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet X 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71), samples of materials deposited in accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent Is granted on the application; or the application has lapsed or been 
withdrawn or refused; to a person who is: a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications e.g.. 
"A ccession Number of Depo.'iit '*) 



For receiving Office use only 



This sheet wns received with the international application 



Authorized officofN , 



For Iriternaiional Bureau use only 



I I This sheet was received by the International Bureau < 



Authorized officer 



Form PCT/RO/134(July 1992) 



LegatSiar 1997, FormPCTMS 



wo 01/31014 






PCT/USOO/29601 




-155- 




Applicant's or agent's file 




Internation: 




reference number 28341/6?76P 




To Be Det 





INDICATIONS RELATING TO A DEPOSITED MJCROORGANISM 

(PCT Rule \2fyh) 



A. The indications made below relate to the niicroor\;anism referred to in the description 
on page 97: 98 .''"e ^-R* '>'\ 



D. IDENTIFICATION OF DEPOSIT Furthei deposits arc identified on an additional sheet [X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 


Accession Number 


18 January 2000 


B-30255 



C. ADDITIONAL INDICATIONS (leave blank if not appUcableJ This information is continued on an additional sheet 



In respecl of those designations in which a European patent or a patent in Norway is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the corresponding information concerning the patent in Nonway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations in Norway). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general natttre of the indications e.g., 
"A ccessio n Number of Deposit **) 



For receiving Office use only 



This sheet was received with the international applicalion 



For intemational Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



Legal Star 1007. Form PCTM5 



wo 01/31014 



PCT/USOO/29601 



-156- 



Applicant's or agent's fi Ic • 
reference number 28341hj^j vi 



Internalioniil ap) 
To Be Determi 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \Zbis) 



A. The indications made below relate to the microorganisni referred to in the description 
on page , line 1-3; 24 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identiHed on an additional sheet 



X 



Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural UUIization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street. Peoria, Illinois 81604 U.S. A. 



Date of deposit 
18 January 2000 



Accession Number 
B-30256 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



When designating Australia, in accordance with regulation 3.25 of the Patents Regulations (Australia Statutory 
Rules 1991 No. 71), samples of nnaterials deposited in accordance with the Budapest Treaty in relation to this Patent 
Request are only to be provided before: the patent is granted on the application; or the application has lapsed or beer* 
withdrawn or refused; to a person who is; a skilled addressee without an interest in the invention; and nominated by a 
person who makes a request for the furnishing of those samples. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted lo (he International Bureau later (specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



For receiving OfTicc use only 



This sheet was received with the international application 



For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized ofTicer 



Form PCT/RO/1 34 (July 1 992) 



LeoalStaf 1997. FormPCTMS 



wo 01/31014 


-157- 


PCT/USOO/29601 


Applicant's or agent's flic ) 
reference number 28341<u«.# ui *" 


International a 
To Be Detern 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule noh) 



A. The indications made below relate to the microorganism referred lo in the description 

on page _98 .line 1-3: 24 • 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet fX 

Name of depositary institution 

Agricultural Research Service Culture Collection 



Address of depositary institution (including postal code and country) 
National Center for Agricultural Utilization Research 
Agricultural Research Service, U.S. Department of Agriculture 
1815 North University Street, Peoria, Illinois 61604 U.S.A. 



Date of deposit 
18 January 2000 


Accession Number 
B-30256 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet X 



In respect of those designations in which a European patenters patent in NonA^ay is sought, a sample of the 
deposited microorganism will be made available until the publication of the mention of the grant of the European patent or 
the corresponding information concerning the patent in Norway or until the date on which the application has been refused 
or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28 (4) EPC and the corresponding regulations in Norway). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the Intemarional Buneau later (specify the general nature of the indications e.g.. 
"A ccession Number of Deposit ") 



For receiving Office use only 



This sheet was received with the international application 



Authorised officer 



For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



L.egalSUf 1997. Form PCTM5 



wo 01/31014 



-158- 



PCT/USOO/29601 



CLAIMS 

What is claimed is: 

1 . A purified and isolated seven transmembrane receptor polypeptide 
comprising an amino acid sequence at least 90% identical to an amino acid sequence 

5 set forth in any one of S£Q ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 1 8 or 20, or a fragment 

thereof comprising an epitope specific to said seven transmembrane receptor 
polypeptide. 

2. A purified and isolated seven transmembrane receptor polypeptide 
10 according to claim 1 comprising an amino acid sequence at least 90% identical to the 

amino acid sequence set forth in SEQ ID NO: 2, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 

3. A purified and isolated seven transmembrane receptor polypeptide 
15 according to claim 1 comprising an amino acid sequence at least 90% identical to the 

amino acid sequence set forth in SEQ ID NO: 4, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 

4. A purified and isolated seven transmembrane receptor polypeptide 
20 according to claim 1 comprising an amino acid sequence at least 90% identical to the 

amino acid sequence set forth in SEQ ID NO: 6, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 

5. A pyrified and isolated seven transmembrane receptor polypeptide 
25 according to claim I comprising an amino acid sequence at least 90% identical to the 

amino acid sequence set forth in SEQ TD NO: 8, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 
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6. A purified and isolated seven transmembrane receptor polypeptide 
according to claim I comprising an amino acid sequence at least 90% identical to the 
amino acid sequence set forth in SEQ ID NO: 10, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 

5 

7. A purified and isolated seven tiansmembrane receptor polypeptide 
according to claim 1 comprising an amino acid sequence at least 90% identical to the 
amino acid sequence set forth in SEQ ID NO: 12, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 

10 

8. A purified and isolated seven transmembrane receptor polypeptide 
according to claim 1 comprising an amino acid sequence at least 90% identical to the 
amino acid sequence set forth in SEQ ID NO: 14, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 

15 

9. A purified and isolated seven transmembrane receptor polypeptide 
according to claim 1 comprising an amino acid sequence at least 90% identical to the 
amino acid sequence set forth in SEQ ID NO: 1 6, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide, 

20 

10. A purified and isolated seven transmembrane receptor polypeptide 
according to claim 1 comprising an amino acid sequence at least 90% identical to the 
amino acid sequence set forth in SEQ ID NO: 18, or a fi*agment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 

25 

1 1 . A purified and isolated seven transmembrane receptor polypeptide 
according to claim 1 comprising an amino acid sequence at least 90% identical to the 
amino acid sequence set forth in SEQ ID NO: 20, or a fragment thereof comprising an 
epitope specific to said seven transmembrane receptor polypeptide. 
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12. A purified and isolated seven transmembrane receptor polypeptide 
according to any one of claims I - 1 1 . 

13. A purified and isolated polypeptide according to any one of claims 
5 1-11 comprising at least one extracellular domain of the seven transmembrane 

receptor polypeptide. 

14. A purified and isolated polypeptide according to any one of claims 
I-l 1 comprising the N-terminal extracellular domain of the seven transmembrane 

1 0 receptor polypeptide. 

15. A purified and isolated polypeptide according to any one of claims 
1-11 comprising a seven transmembrane receptor fragment selected firom the group 
consisting of an N-terminal extracellular domain transmembrane domains, 

15 extracellular loops connecting transmembrane domains, intracellular loops connecting 

transmembrane domains, a C-terminal cytoplasmic domain, and fusions thereof 

16. A polypeptide according to any one of claims 1-15, wherein the 
polypeptide further includes a heterologous tag amino acid sequence. 

20 

1 7. A purified and isolated polynucleotide comprising a nucleotide 
sequence that encodes the polypeptide of claim 1 6. 

18. A purified and isolated polynucleotide comprising a nucleotide 
25 sequence that encodes a polypeptide according to any one of claims 2, 3, 4, 8 or 9. 

19. A purified and isolated polynucleotide comprising a heterologous 
expression control sequence operatively Hnked to a nucleotide sequence that encodes 
a polypeptide according to any one of claims 1-16. 
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20. The polynucleotide according to claim 1 9, wherein the expression 
control sequence is a promoter sequence that promotes expression of said 
polynucleotide in an eukaryotic cell. 

5 21. The polynucleotide according to claim 1 9, wherein the promoter is 

a heterologous promoter that promotes expression of the polynucleotide in a human 
cell. 

22. A purified and isolated polynucleotide comprising a nucleotide 
10 sequence that encodes a mammalian seven transmembrane receptor, wherein said 

polynucleotide hybridizes to any one of the nucleotide sequences set forth in SEQ ID 
NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, or 19 or the non-coding strand complementary 
thereto, under the following hybridization conditions: 

(a) hybridization for 16 hours at 42°C in a hybridization solution 
15 comprising 50% fomnamide, 1% SDS, 1 M NaCl , 10% dextran sulfate and 

(b) washing 2 times for 30 minutes at 60°C in a wash solution 
comprising O.lx SSC and 1% SDS, 

with the proviso that the nucleotide sequence of the polynucleotide differs from the 
coding sequence set forth in any one of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, or 
20 19 and from its complementary strand by at least one nucleotide. 

23. A polynucleotide according to claim 22 that encodes a human 
seven transmembrane receptor. 

25 24. A vector comprising a polynucleotide according to any one of 

claims 17-23. 

25. A vector according to claim 24 that is an expression vector for 
expressing the polynucleotide in a mammalian cell. 

30 
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26. A host cell stably transformed or transfected with a polynucleotide 
according to any one ofciaims 17-23 in a manner allowing the expression in said host 
cell of the polypeptide or fragment thereof encoded by the polynucleotide. 

5 27. A host cell stably transformed or transfected with a vector 

according to claim 24 or 25 in a manner allowing the expression in said host cell of 
the polypeptide or fragment thereof encoded by the polynucleotide. 

28. A method for producing a seven transmembrane receptor 

10 polypeptide comprising the steps of growing a host cell according to claim 26 or 27 in 

a nutrient medium under conditions in which the host cell expresses a seven 
transmembrane receptor encoded by the polynucleotide. 

29. A method according to claim 28, further comprising a step of 
15 isolating said polypeptide from said cell or said medium. 

30. A method according to claim 29, fiirther comprising a step of 
isolating cell membranes from the host cell, wherein the cell membrane comprises the 
seven transmembrane receptor. 

20 

31. An antibody specific for a polypeptide according to any one of 

claims 1-15. 

32. The antibody of claim 31 which is a monoclonal antibody. 

25 

33. A hybridoma that produces an antibody according to claim 32. 

34. An antibody according to claim 31 that is a humanized antibody. 
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35. An antibody according to claim 31 that specifically binds an 
extracellular epitope ofa seven transmembrane receptor having an amino acid 
sequence selected from the group consisting of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 
16, 18 or 20. 

5 

36. An antibody according to claun 35 that speci fically binds to Ihe 
amino-terminal extracellular domain of the seven transmembrane receptors. 

37. A cell-free composition comprising polyclonal antibodies, wherein 
10 at least one of said antibodies is an antibody according to claim 31. 

38. An anti-idiotypic antibody specific for an antibody according to 

claim 31. 

15 39. A polypeptide comprising a fragment of an antibody according to 

claim 31, wherein said fragment and said polypeptide specifically bind to a seven 
transmembrane receptor having an amino acid sequence selected from the group 
consisting of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18 or 20. 

20 40. A polypeptide according to claim 39 that is selected from the 

group consisting of single chain antibodies and CDR-grafted antibodies. 

41 . A composition comprising a polypeptide according to any one of 
claims 1-16 in a pharmaceutical ly acceptable carrier. 

25 

42. A composition comprising an antibody according to any one of 
claims 3 1, 32, 34, 35, or 36, or a polypeptide according to claim 39 or 40, in a 
pharmaceutically acceptable carrier. 
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43. A melhod for modulating ligand binding of a seven 
transmembrane receptor polypeptide according to any one of claims 1-15, comprising 
the step of contacting said seven transmembrane receptor polypeptide with an 
antibody specific for said seven transmembrane receptor, under conditions wherein 
the antibody binds the receptor. 

44. A method for modulating ligand binding of a seven 
transmembrane receptor polypeptide comprising the step of contacting said seven 
transmembrane receptor polypeptide with a polypeptide according to claim 39 or 40. 

45. An assay to identify compounds that bind a seven transmembrane 
receptor polypeptide, said assay comprising the steps of; 

(a) contacting a composition comprising a seven transmembrane 
receptor polypeptide according to any of claims 1-15 with a compound suspected of 
binding the seven transmembrane receptor polypeptide; and 

(b) measuring binding between the compound and the seven 
transmembrane receptor polypeptide. 

46. A method for identifying a modulator of binding between a seven 
transmembrane receptor polypeptide and a binding partner of the seven 
transmembrane receptor polypeptide, comprising the steps of; 

(a) contacting the binding partner and a composition comprising 
the seven transmembrane receptor polypeptide in the presence and in the absence of a 
putative modulator compound, where die seven transmembrane receptor polypeptide 
is a polypeptide according to any one of claims 1-15; 

(b) measuring binding between the binding partner and said seven 
transmembrane receptor polypeptide; and 

(c) identifying a putative modulator compound in view of 
decreased or increased binding between the binding partner and seven transmembrane 
receptor polypeptide in the presence of the putative modulator, as compared to 
binding in the absence of the putative modulator. 
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47. An assay according to claim 45 or 46 wherein the composition 
comprises a cell expressing the seven transmembrane receptor polypeptide on its 
surface. 

5 48. An assay according to claim 47 wherein the measuring step 

comprises measuring intracellular signaling of the seven transmembrane receptor 
polypeptide induced by the compound. 



49. A method for treating a neurological disorder comprising the step 
1 0 of administering to a mammal in need of such treatment a pharmaceutical 

composition comprising a compound in an amount effective to modulate biological 
activity of a seven transmembrane receptor in neurons of said mammal, wherein the 
compound is selected from the group consisting of: 

(a) an antibody according to any one of claims 31,32, 34, 35, or 36; 
1 5 (b) an anti -idiotypic antibody according to claim 38; 

(c) a polypeptide according to claim 39 or 40; 

(d) a compound identified according to the method of claim 45; and 

(e) a modulator identified according to claim 46. 

20 50. The method of claim 49 wherein the neurological disorder is 

schizophrenia. 



5 1 . A method according to claim 50, wherein the seven 
transmembrane receptor comprises a polypeptide according to claim 8. 

25 

52. A method of treating scliizophrenia comprising the step of 
administering to a human diagnosed with schizophrenia an amount of a modulator of 
CON202 receptor activity sufficient to modulate CON202 receptor activity or 
CON202 ligand binding in said human. 
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53. A method of diagnosing schizophrenia or a susceptibility to 
schizophrenia comprising the steps of: 

(a) measuring the presence or amount of expression or activity of a 
polypeptide according to claim 8 in a cell of a human patient; and 
5 (b) comparing the measurement of step (a) to a measurement of expression 

or activity of the polypeptide in a cell from a normal subject or the patient at an earlier 
time, wherein the diagnosis of schizophrenia or susceptibility to schizophrenia is 
based on the presence or amount of CON202 polypeptide expression or activity. 

10 54. A method of screening a human subject to diagnose a disorder 

affecting the brain or genetic predisposition therefor, comprising the steps of: 

(a) assaying nucleic acid of a human subject to determine a presence or an 
absence of a mutation altering the amino acid sequence, expression, or biological 
activity of at least one seven transmembrane receptor that is expressed in the brain, 

1 5 wherein the seven transmembrane receptor comprises an amino acid sequence 

selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, and 
20, or an allelic variant thereof, and wherein the nucleic acid con esponds to the gene 
encoding the seven transmembrane receptor; and 

(b) diagnosing the disorder or predisposition from the presence or absence of 
20 said mutation, wherein the presence of a mutation altering the amino acid sequence, 

expression, or biological activity of allele in the nucleic acid correlates with an 
increased risk of developing the disorder. 

55. A method according to claim 54, wherein the seven 

25 transmembrane receptor is CON202 comprising an amino acid sequence set forth in 
SEQ ID NO: 14, or an allelic variant thereof 

56. A method according to claim 55, wherein the disease is 

schizophrenia. 
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57. A method according to claim 56, wherein the assaying step 
comprises at least one procedure selected from the group consisting of: 

(a) determining a nucleotide sequence of at least one codon of at least one 
CON202 allele of the human subject; 
5 (b) performing a hybridization assay to determine whether nucleic acid 

from the human subject has a nucleotide sequence identical to or different from one or 
more reference sequences; 

(c) performing a polynucleotide migration assay to determine whether 
nucleic acid from the human subject has a nucleotide sequence identical to or different 

10 from one or more reference sequences; and 

(d) performing a restriction endonuclease digestion to determine whether 
nucleic acid from the human subject has a nucleotide sequence identical to or different 
from one or more reference sequences. 

15 58. A method according to claim 56 wherein the assaying step 

comprises: performing a polymerase chain reaction (PGR) to amplify nucleic acid 
comprising CON202 coding sequence, and determining nucleotide sequence of ihe 
amplified nucleic acid. 



20 59. A method of screening for a CON202 hereditary schizophrenia 

genotype in a human patient, comprising the steps of: 

(a) providing a biological sample comprising nucleic acid from 
said patient, said nucleic acid including sequences corresponding to said patient's 
CON202 alleles; 

25 (b) analyzing said nucleic acid for the presence of a mutation or 

mutations; 

(c) determining a CON202 genotype from said analyzing step; and 

(d) correlating the presence of a mutation in a CON202 allele with 
a hereditary schizophrenia genotype. 
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. 60. The method according to claim 59 wherein said biological 
sample is a cell sample. 

61 . The method according to claim 59 wherein said analyzing 

5 comprises sequencing a portion of said nucleic acid, said portion comprising at least 

one codon of said CON202 alleles. 

62. The method according to claim 59 wherein said nucleic acid is 

DNA. 

10 

63. The method according to claim 59 wherein said nucleic acid is 

RNA. 



64, A kit for screening a human subject to diagnose schizophrenia 
15 or a genetic predisposition therefor, comprising, in association: 

(a) an oligonucleotide useful as a probe for identifying polymorphisms in a 
human CON202 seven transmembrane receptor gene, the oligonucleotide comprising 
6-50 nucleotides that have a sequence that is identical or exactly complementary to a 
portion of a wild type human CON202 gene sequence or CON202 coding sequence, 

20 except for one sequence difference selected from the group consisting of a nucleotide 
addition, a nucleotide deletion, or nucleotide substitution; and 

(b) a media packaged with the ohgonucleotide containing information 
identifying polymorphisms identifyable with the probe that correlate with 
schizophrenia or a genetic predisposition therefor. 
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65. A method oF identifying a seven transmembrane allelic variant 
that correlates with a mental disorder, comprising steps of: 

(a) providing a biological sample comprising nucleic acid from a 
human patient diagnosed with a mental disorder, or from the patient's genetic 
progenitors or progeny; 

(b) analyzing said nucleic acid for the presence of a mutation or 
mutations in at least one seven transmembrane receptor that is expressed in the brain, 
wherein the at least one seven transmembrane receptor comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 
18, and 20, or an alleUc variant thereof, and wherein the nucleic acid includes 
sequence corresponding to the gene or genes encoding the at least one seven 
transmembrane receptor; 

(c) determining a genotype for the patient for the at least one seven 
transmembrane receptor from said analyzing step; and 

(d) identifying an allelic variant that correlates with the mental 
disorder from the determining step. 

66. A method according to claim 65, wherein the disorder is 
schizophrenia, and wherein the at least one seven transmembrane receptor comprises 
CON202 having an amino acid sequence set forth in SEQ ID NO: 14, or an allelic 
variant thereof. 

67. A purified and isolated polynucleotide comprising a nucleotide 
sequence encoding a CON202 receptor allelic variant identified according to claim 66. 

68. A host cell transfomied or transfected with a polynucleotide 
according to claim 67 or with a vector comprising the polyncleotide. 
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69. A purified polynucleotide comprising a nucleotide sequence 
encoding a CON202 seven transmembrane receptor protein of a human that is affected 
with schizophrenia; 

wherein said polynucleotide hybridizes lo the complement of SEQ ID 
NO: 13 under the following hybridization conditions: 

(a) hybridization for 16 hours at 42°C in a hybridization solution 
comprising 50% formamide, 1% SDS, 1 M NaCl, 10% dextran sulfate and 

(b) washing 2 times for 30 minutes at 60'^C in a wash solution 
comprising 0. 1 x SSC and 1 % SDS; and 

wherein the polynucleotide encodes a CON202 amino acid sequence 
that differs from SEQ ID NO: 14 at at least one residue. 

70. A vector comprising a polynucleotide according to claim 69. 

71 . A host cell that has been transformed or transfected with a 
polynucleotide according to claim 70 and that expresses the CON202 protein encoded 
by the polynucleotide. 

72. A host cell according to claim 71 that has been co-transfected 
with a polynucleotide encoding the CON202 amino acid sequence set forth in SEQ ID 
NO: 14 and that expresses the con202 protein having the amino acid sequence set 
forth in SEQ ID NO: 14. 

73. A method for identifying a modulator of CON202 biological 
activity, comprising the steps of: 

a) contacting a cell according to claim 71 in the presence and in 
the absence of a putative modulator coinpound; 

b) measuring CON202 biological activity in the cell; and 

c) identifying a putative modulator compound in view of 
decreased or increased CON202 biological activity in the presence versus absence of 
the putative modulator. 
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74. An assay to identify compounds usefu! for the treatment of 
schizophrenia, said assay comprising steps of: 

(a) contacting a composition comprising a seven transmembrane 
receptor polypeptide according to claim 8 with a compound suspected of binding the 
seven transmembrane receptor polypeptide; 

(b) measuring binding between the compound and the seven 
trajismembrane receptor polypeptide; and 

(c) identifying molecules that bind the seven transmembrane receptor 
as candidate compounds useful for the treatment of schizophrenia. 

75. A method for identifying compound useful for a modulator of 
binding between a seven transmembrane receptor polypeptide and a binding partner of 
the seven transmembrane receptor polypeptide, which modulator is useful for 
treatment of schizophrenia, comprising the steps of: 

(a) contacting the binding partner and a composition comprising 
the seven transmembrane receptor polypeptide in the presence and in the absence of a 
putative modulator compound, where the seven transmembrane receptor polypeptide 
is a polypeptide according to claim 8; 

(b) measuring binding between the binding partner and the seven 
transmembrane receptor polypeptide; 

(c) identifying a modulator compound useful for the treatment of 
schizophrenia in view of decreased or increased binding between the binding partner 
and seven transmembrane receptor polypeptide in the presence of the putative 
modulator, as compared to binding in the absence of the putative modulator. 

76. An assay according to claim 74 or 75 wherein the composition 
comprises a cell expressing the seven transmembrane receptor polypeptide on its 

surface. 
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77. An assay according to claim 76 wherein the composition 
comprises a cell transfomieci or transfectecl with a polyiuicleotide encoding the seven 
transmembrane polypeptide and expressing the seven transmembrane receptor 
polypeptide on its siirfiice. 
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SEQUENCE LISTING 



<110> Pharmacia & Upjohn Company 

<120> G PROTEIN-COUPLED RECEPTORS EXPRESSED IN BRAIN 

<130> 28341/6276P 

<140> 
<141> 

<150> US 09/481,794 

<151> 2000-01-12 

<150> US 09/454,399 
<151> 1999-12-03 

<150> US 09/429,517 
<151> 1999-10-28 

<1S0> US 09/429,555 
<151> 1999-10-28 

<150> US 09/429,676 
<151> 1999-10-28 

<150> US 09/429,695 
<151> 1999-10-28 

<150> US 09/428, 114 
<151> 1999-10-27 

<150> US 09/428, 020 
<151> 1999-10-27 

<150> US 09/427, 859 
<151> 1999-10-27 

<150> US 09/427,653 
<151> 1999-10-27 

<160> 75 

<170> PatentIn Ver. 2.0 

<210> 1 

<211> 1308 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (157) . . (1122) 
<220> 

<221> misc_feature 
<222> (1) 

<223> N = A or C or G or T 



<220> 

<22l> misc feature 
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<222> (1237) 

<223> N = A or C or G or T 
<220> 

<221> misc^feature 
<222> (1274) 

<223> N = A or C or G or T 
<400> 1 

ntggttgttg gaccattaaa atgcattatg gaatttttaa aagttggggg agagggagac 60 

agtaaaaata acctatattt tctcttgttt tttttttttt aactctagga aagcccagac 120 

aaattttgag ctatttcata acctaccaga cttatc atg eta aca ctg aat aaa 174 

Met Leu Thr Leu Asn Lys 
1 5 

aca gac eta ata cea get tea ttt att ctg aat gga gtc cca gga ctg 222 
Thr Asp Leu lie Pro Ala Ser Phe lie Leu Asn Gly Val Pro Gly Leu 
10 15 20 

gaa gae aca caa etc tgg att tec ttc cca ttc tgc tct atg tat gtt 270 
Glu Asp Thr Gin Leu Trp lie Ser Phe Pro Phe Cys Ser Met Tyr Val 
25 30 35 

gtg get atg gta ggg aat tgt gga etc etc tae etc att cae tat gag 318 
Val Ala Met Val Gly Asn Cys Gly Leu Leu Tyr Leu lie His Tyr Glu 
40 45 50 

gat gee ctg cae aaa cce atg tae tae ttc ttg gee atg ctt tec ttt 366 
Asp Ala Leu His Lys Pro Met Tyr Tyr Phe Leu Ala Met Leu Ser Phe 
55 60 65 70 

act gac ett gtt atg tgc tet agt aca ate ect aaa gee etc tge ate 414 
Thr Asp Leu Val Met Cys Ser Ser Thr lie Pro Lys Ala Leu Cys lie 
75 80 85 

ttc tgg ttt eat etc aag gac att gga ttt gat gaa tgc ett gtc cag 462 
Phe Trp Phe His Leu Lys Asp lie Gly Phe Asp Glu Cys Leu Val Gin 
90 95 100 

atg ttc ttc ate cac ace ttc aca ggg atg gag tct ggg gtg ctt atg 510 
Met Phe Phe He His Thr Phe Thr Gly Met Glu Ser Gly Val Leu Met 
105 110 115 

ctt atg gee ctg gat ege tat gtg gee ate tgc tae ccc tta cgc tat 558 
Leu Met Ala Leu Asp Arg Tyr Val Ala He Cys Tyr Pro Leu Arg Tyr 
120 125 130 

tea act ate etc aec aat ect gta att gca aag gtt ggg act gcc ace 606 
Ser Thr He Leu Thr Asn Pro Val He Ala Lys Val Gly Thr Ala Thr 
135 140 145 150 

ttc ctg aga ggg gta tta etc att att cce ttt act ttc etc ace aag 654 
Phe Leu Arg Gly Val Leu Leu He He Pro Phe Thr Phe Leu Thr Lys 
155 160 165 

cgc ctg cce tec tgc aga ggc aat ata ctt ccc eat ace tae tgt gae 702 
Arg Leu Pro Ser Cys Arg Gly Asn He Leu Pro His Thr Tyr Cys Asp 
170 175 ISO 
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cac atg tct gta gcc aaa ttg tec tgt ggt aat gtc aag gtc aat gcc 750 
His Met Ser Val Ala Lys Leu Set Cys Gly Asn Val Lys Val Asn Ala 
185 190 195 

ate tat ggt ctg atg gtt gee etc ctg att ggg ggc ttt gae ata ctg 798 
lie Tyr Gly Leu Met Val Ala Leu Leu lie Gly Gly Phe Asp He Leu 
200 205 210 

tgt ate acc ate tec tat ace atg att etc egg gea gtg gtc age etc 846 
Cys He Thr He Ser Tyr Thr Met He Leu Arg Ala Val Val Ser Leu 
215 220 225 230 

tec tea gea gat get egg cag aag gee ttt aat acc tge act gee cac 894 
Ser Ser Ala Asp Ala Arg Gin Lys Ala Phe Asn Thr Cys Thr Ala His 
235 240 245 

att tgt gee att gtt ttc tee tat act eca get tte tte tec ttc ttt 942 
He Cys Ala He Val Phe Ser Tyr Thr Pro Ala Phe Phe Ser Phe Phe 
250 255 260 

tec cac cge ttt ggg gaa eae ata ate ecc cct tct tge eac ate att 990 
Ser His Arg Phe Gly Glu His He He Pro Pro Ser Cys His He He 
265 270 . 275 

gta gee aat att tat ctg etc eta cea ecc act atg aac eet att gte 1038 
Val Ala Asn He Tyr Leu Leu Leu Pro Pro Thr Met Asn Pro He Val 
280 285 290 

tat ggg gtg aaa acc aaa cag ata cga gac tgt gtc ata agg ate ett 1086 
Tyr Gly Val Lys Thr Lys Gin He Arg Asp Cys Val He Arg He Leu 
295 300 305 310 

tea ggt tet aag gat ace aaa tec tac age atg tga atgaacactt 1132 
Ser Gly Ser Lys Asp Thr Lys Ser Tyr Ser Met 
315 320 

geeaggagtg agaagagaag gaaagaatta cttctatttg eetettatge aggagttcat 1192 

aaaatetttc tggaagtaet gtattgatea caaaatggag tttgntgact ggtgeattct 1252 

caataagtac cttgggaate tnacateaet ggaaggceca eeacatttct ataaat 1308 



<210> 2 
<211> 321 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Leu Thr Leu Asn Lys Thr Asp Leu He Pro Ala Ser Phe He Leu 
15 10 15 

Asn Gly Val Pro Gly Leu Glu Asp Thr Gin Leu Trp He Ser Phe Pro 
20 25 30 

Phe Cys Ser Met Tyr Val Val Ala Met Val Gly Asn Cys Gly Leu Leu 
35 40 45 

Tyr Leu He His Tyr Glu Asp Ala Leu His Lys Pro Met Tyr Tyr Phe 
50 55 60 

Leu Ala Met Leu Ser Phe Thr Asp Leu Val Met Cys Ser Ser Thr He 
65 70 75 80 
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Pro Lys Ala Leu Cys lie Phe Trp Phe His Leu Lys Asp lie Gly Phe 

85 90 95 

Asp Glu Cys Leu Val Gin Met Phe Phe lie His Thr Phe Thr Gly Met 

100 105 110 



Glu Ser Gly Val Leu Met Leu Met Ala Leu Asp Arg Tyr Val Ala lie 
115 120 125 

Cys Tyr Pro Leu Arg Tyr Ser Thr lie Leu Thr Asn Pro Val lie Ala 
130 135 140 

Lys Val Gly Thr Ala Thr Phe Leu Arg Gly Val Leu Leu He He Pro 
145 150 155 160 

Phe Thr Phe Leu Thr Lys Arg Leu Pro Ser Cys Arg Gly Asn He Leu 
165 170 ' 175 

Pro His Thr Tyr Cys Asp His Met Ser Val Ala Lys Leu Ser Cys Gly 
180 185 190 

Asn Val Lys Val Asn Ala He Tyr Gly Leu Met Val Ala Leu Leu He 
195 200 205 

Gly Gly Phe Asp He Leu Cys He Thr He Ser Tyr Thr Met He Leu 
210 215 220 

Arg Ala Val Val Ser Leu Ser Ser Ala Asp Ala Arg Gin Lys Ala Phe 
225 230 235 240 

Asn Thr Cys Thr Ala His He Cys Ala He Val Phe Ser Tyr Thr Pro 
245 250 255 

Ala Phe Phe Ser Phe Phe Ser His Arg Phe Gly Glu His He He Pro 
. 260 265 270 

Pro Ser Cys His He He Val Ala Asn He Tyr Leu Leu Leu Pro Pro 
275 280 285 

Thr Met Asn Pro He Val Tyr Gly Val Lys Thr Lys Gin He Arg Asp 
290 295 300 

Cys Val He Arg He Leu Ser Gly Ser Lys Asp Thr Lys Ser Tyr Ser 
305 310 315 320 



Met 



<210> 3 

<211> 1014 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (1014) 



<400> 3 

atg gat gaa aca gga aat ctg aca gta tct tct gcc aca tgc cat gac 48 

Met Asp Glu Thr Gly Asn Leu Thr Val Ser Ser Ala Thr Cys His Asp 

15 10 15 
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act att gat gac ttc cgc aat caa gtg tat tec acc ttg tac tct atg 96 

Thr He Asp Asp Phe Arg Asn Gin Val Tyr Ser Thr Leu Tyr Ser Met 

20 25 30 

ate tct gtt gta ggc ttc ttt ggc aat ggc ttt gtg etc tat gtc etc 144 

He Ser Val Val Gly Phe Phe Gly Asn Gly Phe Val Leu Tyr Val Leu 

35 40 45 

ata aaa acc tat cac aag aag tea gcc ttc caa gta tac atg att aat 192 

He Lys Thr Tyr His Lys Lys Ser Ala Phe Gin Val Tyr Met He Asn 

50 55 60 

tta gca gta gca gat eta ctt tgt gtg tgc aca ctg cet etc cgt gtg 240 

Leu Ala Val Ala Asp Leu Leu Cys Val Cys Thr Leu Pro Leu Arg Val 

65 70 75 80 



gtc tat tat gtt cac aaa ggc att tgg etc ttt ggt gac ttc ttg tgc 
Val Tyr Tyr Val His Lys Gly He Trp Leu Phe Gly Asp Phe Leu Cys 
85 90 95 



288 



cgc etc age ace tat get ttg tat gtc aac etc tat tgt age ate ttc 336 
Arg Leu Ser Thr Tyr Ala Leu Tyr Val Asn Leu Tyr Cys Ser He Phe 
100 105 110 

ttt atg aca gcc atg age ttt ttc egg tgc att gca att gtt ttt cca 384 
Phe Met Thr Ala Met Ser Phe Phe Arg Cys He Ala He Val Phe Pro 
115 120 125 * 

gtc eag aac att aat ttg gtt aca cag aaa aaa gcc agg ttt gtg tgt 432 
Val Gin Asn He Asn Leu Val Thr Gin Lys Lys Ala Arg Phe Val Cys 
130 135 140 

gta ggt att tgg att ttt gtg att ttg acc agt tct cca ttt eta atg 480 
Val Gly He Trp He Phe Val He Leu Thr Ser Ser Pro Phe Leu Met 
145 150 155 160 

gcc aaa cca caa aaa gat gag aaa aat aat acc aag tgc ttt gag ccc 528 
Ala Lys Pro Gin Lys Asp Glu Lys Asn Asn Thr Lys Cys Phe Glu Pro 
165 170 175 

cca caa gac aat caa act aaa aat cat gtt ttg gtc ttg cat tat gtg 576 
Pro Gin Asp Asn Gin Thr Lys Asn His Val Leu Val Leu His Tyr Val 
180 185 190 

tea ttg ttt gtt ggc ttt ate ate cct ttt gtt att ata att gtc tgt 624 
Ser Leu Phe Val Gly Phe He He Pro Phe Val He He He Val Cys 
195 200 205 

tac aca atg ate att ttg acc tta eta aaa aaa tea atg aaa aaa aat 672 
Tyr Thr Met He He Leu Thr Leu Leu Lys Lys Ser Met Lys Lys Asn 
210 215 220 

ctg tea agt cat aaa aag get ata gga atg ate atg gtc gtg ace get 720 
Leu Ser Ser His Lys Lys Ala He Gly Met He Met Val Val Thr Ala 
225 230 235 240 

gcc ttt tta gtc agt ttc atg cca tat cat att caa cgt ace att cac 768 
Ala Phe Leu Val Ser Phe Met Pro Tyr His He Gin Arg Thr He . His 
245 250 255 

ctt cat ttt tta cac aat gaa act aaa ccc tgt gat tct gtc ctt aga 816 
Leu His Phe Leu His Asn Glu Thr Lys Pro Cys Asp Ser Val Leu Arg 
260 265 270 



wo 01/31014 



PCT/USOO/29601 



-6- 

atg cag aag tec gtg gtc ata acc ttg tct ctg get gca tec aat tgt 864 

Met Gin Lys Ser Val Val lie Thr Leu Ser Leu Ala Ala Ser Asn Cys 
275 280 285 

tgc ttt gac cct etc eta tat ttc ttt tct ggg ggt aac ttt agg aaa 912 

Cys Phe Asp Pro Leu Leu Tyr Phe Phe Ser Gly Gly Asn Phe Arg Lys 

290 295 300 

agg ctg tct aca ttt aga aag cat tct ttg tec age gtg act tat gta 960 

Arg Leu Ser Thr Phe Arg Lys His Ser Leu Ser Ser Val Thr Tyr Val 

305 310 315 320 

cec aga aag aag gcc tct ttg cca gaa aaa gga gaa gaa ata tgt aaa 1008 

Pro Arg Lys Lys Ala Ser Leu Pro Glu Lys Gly Glu Glu lie Cys Lys 
325 330 335 

gta tag 1014 
Val 



<210> 4 

<211> 337 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Asp Glu Thr Gly Asn Leu Thr Val Ser Ser Ala Thr Cys His Asp 
15 10 15 

Thr lie Asp Asp Phe Arg Asn Gin Val Tyr Ser Thr Leu Tyr Ser Met 
20 25 30 

lie Ser Val Val Gly Phe Phe Gly Asn Gly Phe Val Leu Tyr Val Leu 
35 40 45 

lie Lys Thr Tyr His Lys Lys Ser Ala Phe Gin Val Tyr Met lie Asn 
50 55 60 

Leu Ala Val Ala Asp Leu Leu Cys Val Cys Thr Leu Pro Leu Arg Val 
65 70 75 80 

Val Tyr Tyr Val His Lys Gly lie Trp Leu Phe Gly Asp Phe Leu Cys 
85 90 95 

Arg Leu Ser Thr Tyr Ala Leu Tyr Val Asn Leu Tyr Cys Ser lie Phe 
100 105 110 

Phe Met Thr Ala Met Ser Phe Phe Arg Cys lie Ala lie Val Phe Pro 
115 120 125 

Val Gin Asn lie Asn Leu Val Thr Gin Lys Lys Ala Arg Phe Val Cys 
130 135 140 

Val Gly He Trp He Phe Val He Leu Thr Ser Ser Pro Phe Leu Met 
145 150 155 160 

Ala Lys Pro Gin Lys Asp Glu Lys Asn Asn Thr Lys Cys Phe Glu Pro 
165 170 175 

Pro Gin Asp Asn Gin Thr Lys Asn His Val Leu Val Leu His Tyr Val 
180 185 190 
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Ser Leu Phe Val Gly Phe He He Pro phe Val He He He Val Cys 
195 200 205 

Tyr Thr Met He He Leu Thr Leu Leu Lys Lys Ser Met Lys Lys Asn 
210 215 220 

Leu Ser Ser His Lys Lys Ala He Gly Met He Met Val Val Thr Ala 
225 230 235 240 

Ala Phe Leu Val Ser Phe Met Pro Tyr His He Gin Arg Thr lie His 
245 250 255 

Leu His Phe Leu His Asn Glu Thr Lys Pro Cys Asp Ser Val Leu Arg 
260 265 270 

Met Gin Lye Ser Val Val He Thr Leu Ser Leu Ala Ala Ser Asn Cys 
275 280 285 

Cys Phe Asp Pro Leu Leu Tyr Phe Phe Ser Gly Gly Asn Phe Arg Lys 
290 - 295 300 

Arg Leu Ser Thr Phe Arg Lys His Ser Leu Ser Ser Val Thr Tyr Val 
305 310 315 320 

Pro Arg Lys Lys Ala Ser Leu Pro Glu Lys Gly Glu Glu He Cys Lys 
325 330 335 

Val 



<210> 5 

<211> 2429 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (691) . . (1845) 
<400> 5 

ggggcctact tcaccgtgta 
acctgggagc aactgaaagc 
gcacaaatag gactggttcc 
ccctgcctac ctctcaggac 
tgggggcagg gaggcagcca 
ggctgcatga tcctgagagc 
ttctctgggc ttccatcttg 
cccccaggct gggctcagag 
gaaggaggga gggctgcagg 
^gtggctgcg agaatgctga 
atagtgccaa tcatcccact 



cccggacttg ggaccatcac 
tgaactacag tgggctttca 
ctccaggcca ccagcagggc 
aatgtccttt tggctccaca 
ccagcctcta tatgtagtgg 
ccccacctca cccggctgga 
cccctgctga gccctgcttc 
acctcatgtg gtgggatcac 
gttccccttg gcctgcaaac 
tgaaaacccc aggatgttgt 
ttgccctgag cactcctgca 



agacttcaga 


accatcagga 


60 


gacacacagc 


aggctgcgga 


120 


ggtggaggtc 


ttcactgact 


180 


gtccctgaag 


ccagagctgg 


240 


aggagggggt 


gtccagggag 


300 


ctatcctccc 


acttcagggt 


360 


ctcctctacc 


agcagcacaa 


420 


tcagtacccc 


gaggcggagg 


480 


aggaacacag 


ggtgtttctc 


540 


gtcaccgtgg 


tggccagctg 


600 


ggggtagaag 


actccagaac 


660 
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cttctctcag gcccatggcc caagcagccc acg gaa ctt cat aac ctg age tct 714 

Met Glu Leu His Asn Leu Ser Ser 
1 5 

cca tct ccc tct etc tec tec tct gtt etc cct ecc tec ttc tct ccc 7S2 
Pro Ser Pro Ser Leu Ser Ser Ser Val Leu Pro Pro Ser Phe Ser Pro 
10 15 20 

tea cce tec tct get ccc tct gee ttt ace act gtg ggg ggg tec tct 810 
Ser Pro Ser Ser Ala Pro Ser Ala Phe Thr Thr Val Gly Gly Ser Ser 
25 30 35 40 

gga ggg ccc tgc cac ccc acc tct tee teg ctg gtg tet gcc ttc ctg 858 
Gly Gly Pro Cys His Pro Thr Ser Ser Ser Leu Val Ser Ala Phe Leu 
45 50 55 

gea cca ate ctg gcc ctg gag ttt gtc ctg gge ctg gtg ggg aac agt 906 
Ala Pro lie Leu Ala Leu Glu Phe Val Leu Gly Leu Val Gly Asn Ser 
60 65 70 

ttg gcc etc ttc ate ttc tgc ate cac acg egg ccc tgg acc tee. aac 954 
Leu Ala Leu Phe lie Phe Cys lie His Thr Arg Pro Trp Thr Ser Asn 
75 80 85 

acg gtg ttc ctg gtc age ctg gtg gee get gae ttc etc ctg ate age 1002 
Thr Val Phe Leu Val Ser Leu Val Ala Ala Asp Phe Leu Leu lie Ser 
90 95 100 

aac ctg ccc etc cgc gtg gae tac tac etc etc eat gag aec tgg cgc 1050 
Asn Leu Pro Leu Arg Val Asp Tyr Tyr Leu Leu His Glu Thr Trp Arg 
105 110 115 120 

ttt ggg get get gcc tgc aaa gtc aac etc ttc atg ctg tec ace aac 1098 
Phe Gly Ala Ala Ala Cys Lys Val Asn Leu Phe Met Leu Ser Thr Asn 
125 130 135 

cgc acg gcc age gtt gtc ttc etc aea gcc ate gea etc aac cgc tac 1146 
Arg Thr Ala Ser Val Val Phe Leu Thr Ala lie Ala Leu Asn Arg Tyr 
140 145 150 

ctg aag gtg gtg cag ccc cac cac gtg ctg age egt get tee gtg ggg 1194 
Leu Lys Val Val Gin Pro His His Val Leu Ser Arg Ala Ser Val Gly 
155 160 165 

gea get gcc egg gtg gee ggg gga etc tgg gtg gge ate ctg etc etc 1242 
Ala Ala Ala Arg Val Ala Gly Gly Leu Trp Val Gly lie Leu Leu Leu 
170 175 180 

aac ggg cac ctg etc ctg age ace ttc tec gge ccc tec tgc etc age 1290 
Asn Gly His Leu Leu Leu Ser Thr Phe Ser Gly Pro Ser Cys Leu Ser 
185 190 195 200 

tac agg gtg gge aeg aag cee teg gcc teg etc cgc tgg cac cag gea 1338 
Tyr Arg Val Gly Thr Lys Pro Ser Ala Ser Leu Arg Trp His Gin Ala 
205 210 215 

ctg tac ctg ctg gag ttc ttc ctg cea ctg gcg etc ate etc ttt get 1386 
Leu Tyr Leu Leu Glu Phe Phe Leu Pro Leu Ala Leu lie Leu Phe Ala 
220 225 230 
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att gtg age att ggg etc acc ate egg aae cgt ggt etg ggc ggg cag 1434 
lie Val Ser lie Gly Leu Thr lie Arg Asn Arg Gly Leu Gly Gly Gin 
235 240 245 

gca ggc ccg cag agg gee atg cgt gtg etg gcc atg gtg gtg gee gtc 1482 
Ala Gly Pro Gin Arg Ala Met Arg Val Leu Ala Met Val Val Ala Val 
250 255 260 

tac acc ate tgc ttc ttg ccc age ate ate ttt ggc atg get tec atg 1530 
Tyr Thr lie Cys Phe Leu Pro Ser lie lie Phe Gly Met Ala Ser Met 
265 270 275 280 

gtg get ttc tgg etg tec gcc tge cga tec etg gac etc tgc aea cag 1578 
Val Ala Phe Trp Leu Ser Ala Cys Arg Ser Leu Asp Leu Cys Thr Gin 
285 290 295 

etc ttc cat ggc tec etg gcc ttc ace tac etc aac agt gtc etg gac 1626 
Leu Phe His Gly Ser Leu Ala Phe Thr Tyr Leu Asn Ser Val Leu Asp 
300 305 310 

ccc gtg etc tac tgc ttc tct age ccc aac ttc etc cae cag age egg 1674 
Pro Val Leu Tyr Cys Phe Ser Ser Pro Asn Phe Leu His Gin Ser Arg 
315 320 325 

gee ttg etg ggc etc acg egg ggc egg cag gge cca gtg age gac gag 1722 
Ala Leu Leu Gly Leu Thr Arg Gly Arg Gin Gly Pro Val Ser Asp Glu 
330 335 . 340 

age tec tac caa ccc tec agg cag tgg cgc tac egg gag gcc tet agg 1770 
Ser Ser Tyr Gin Pro Ser Arg Gin Trp Arg Tyr Arg Glu Ala Ser Arg 
345 350 355 350 

aag gcg gag gcc ata ggg aag etg aaa gtg cag gge gag gtc tct etg 1818 
Lys Ala Glu Ala He Gly Lys Leu Lys Val Gin Gly Glu Val Ser Leu 
365 370 375 

gaa aag gaa ggc tec tec cag ggc tga gggccagctg cagggctgca 1865 
Glu Lys Glu Gly Ser Ser Gin Gly 

380 385 



gegctgtggg 


ggtaagggct 


geegcgctct 


ggcctggagg 


gacaaggeca 


geacaeggtg 


1925 


cetcaaccaa 


etggaeaagg 


gatggcggca 


gaccaggggc 


caggccaaag 


cactggcagg 


1985 


actcatgtgg 


gtggcaggga 


gagaaaeeca 


cctaggcctc 


tcagtgtgtc 


caggatggca 


2045 


tteeeagaat 


gcaggggaga 


gcaggatgcc 


gggtggagga 


gacaggcaag 


gtgecgttgg 


2105 


cacaeeagct 


cagaeagggg 


cctgcgcage 


tgcaggggac 


agacgccaat 


cactgtcaea 


2165 


gcagagtcac 


cttagaaatt 


ggacagctgc 


atgttctgtg 


ctctccagtt 


tgtcccttcc 


2225 


aatattaata 


aacttccctt 


ttaaatatat 


ttatttgcag 


accaatatct 


gtctttaatt 


2285 


etaacctggg 


aetgtcagta 


ggcgtcaaag 


tgagcgccce 


agtgaaggaa 


ccttggagag 


2345 


agtgggagea 


ttceeagect 


tccaggggga 


ctegtcttcc 


agactttgga 


geccgcatgt 


2405 


ctgaageaga 


etctttcttg 


gtag 








2429 
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<210> 6 

<211> 384 

<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Glu Leu His Asn Leu Ser Ser Pro Ser Pro Ser Leu Ser Ser Ser 
15 10 15 

Val Leu Pro Pro Ser Phe Ser Pro Ser Pro Ser Ser Ala Pro Ser Ala 
20 25 30 

Phe Thr Thr Val Gly Gly Ser Ser Gly Gly Pro Cys His Pro Thr Ser 
35 40 45 

Ser Ser Leu Val Ser Ala Phe Leu Ala Pro lie Leu Ala Leu Glu Phe 
50 55 60 

Val Leu Gly Leu Val Gly Asn Ser Leu Ala Leu Phe lie Phe Cys lie 
65 70 75 80 

His Thr Arg Pro Trp Thr Ser Asn Thr Val Phe Leu Val Ser Leu Val 
85 90 95 

Ala Ala Asp Phe Leu Leu lie Ser Asn Leu Pro Leu Arg Val Asp Tyr 
100 105 110 

Tyr Leu Leu His Glu Thr Trp Arg Phe Gly Ala Ala Ala Cys Lys Val 
•115 120 125 

Asn Leu Phe Met Leu Ser Thr Asn Arg Thr Ala Ser Val Val Phe Leu 
130 135 140 

Thr Ala He Ala Leu Asn Arg Tyr Leu Lys Val Val Gin Pro His His 
145 150 155 160 

Val Leu Ser Arg Ala Ser Val Gly Ala Ala Ala Arg Val Ala Gly Gly 
165 170 175 

Leu Trp Val Gly He Leu Leu Leu Asn Gly His Leu Leu Leu Ser Thr 
180 185 190 

Phe Ser Gly Pro Ser Cys Leu Ser Tyr Arg Val Gly Thr Lys Pro Ser 
195 200 205 

Ala Ser Leu Arg Trp His Gin Ala Leu Tyr Leu Leu Glu Phe Phe Leu 
210 215 220 

Pro Leu Ala Leu He Leu Phe Ala He Val Ser He Gly Leu Thr He 
225 230 .235 240 

Arg Asn Arg Gly Leu Gly Gly Gin Ala Gly Pro Gin Arg Ala Met Arg 
245 250 255 

Val Leu. Ala Met Val Val Ala Val Tyr Thr He Cys Phe Leu Pro Ser 
260 265 270 

He He Phe Gly Met Ala Ser Met Val Ala Phe Trp Leu Ser Ala Cys 
275 280 285 

Arg Ser Leu ASp Leu Cys Thr Gin Leu Phe His Gly Ser Leu Ala Phe 
290 295 300 
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Thr Tyr Leu Asn Ser Val Leu Asp Pro Val Leu Tyr Cys Phe Ser Ser 
305 310 315 ' 320 

Pro Asn Phe Leu His Gin Ser Arg Ala Leu Leu Gly Leu Thr Arg Gly 
325 330 335 

Arg Gin Gly Pro Val Ser Asp Glu Ser Ser Tyr Gin Pro Ser Arg Gin 
340 345 350 

Trp Arg Tyr Arg Glu Ala Ser Arg Lys Ala Glu Ala lie Gly Lys Leu 
355 360 365 

Lys Val Gin Gly Glu Val Ser Leu Glu Lys Glu Gly Ser Ser Gin Gly 
370 375 380 

<210> 7 

<211> 1484 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (146) . . (1147) 

<400> 7 • 

ttgaatttag gtgacactat agaagagcta tgacgtcgca tgcacgcgta cgtaagctcg 60 

gaattcggct cgagctgaac taatgactgc cgccataaga agacagagag aactgagtat 120 

cctcccaaag gtgacactgg aagca atg aac acc aca gtg atg caa ggc ttc 172 

Met Asn Thr Thr Val Met Gin Gly Phe 
1 5 

aac aga tct gag egg tgc ccc aga gac act egg ata gta cag ctg gta 22 0 
Asn Arg Ser Glu Arg Cys Pro Arg Asp Thr Arg lie Val Gin Leu Val 
10 15 20 25 

ttc cca gcc etc tac aca gtg gtt ttc ttg acc ggc ate ctg ctg aat 268 
Phe Pro Ala Leu Tyr Thr Val Val Phe Leu Thr Gly lie Leu Leu Asn 
30 35 40 

act ttg get ctg tgg gtg ttt gtt cac ate ecc age tec tec acc ttc 316 
Thr Leu Ala Leu Trp Val Phe Val His lie Pro Ser Ser Ser Thr Phe 
45 50 55 

ate ate tae etc aaa aac act ttg gtg gee gae ttg ata atg aca etc 364 
lie lie Tyr Leu Lys Asn Thr Leu Val Ala Asp Leu lie Met Thr Leu 
60 65 70 

atg ett cct ttc aaa ate etc tct gac tea cac ctg gca ccc tgg cag 412 
Met Leu Pro Phe Lys He Leu Ser Asp Ser His Leu Ala Pro Trp Gin 
75 80 85 



etc aga get ttt gtg tgt cgt ttt tct teg gtg ata ttt tat gag aee 
Leu Arg Ala Phe Val Cys Arg Phe Ser Ser Val He Phe Tyr Glu Thr 
90 95 100 105 



460 



atg tat gtg ggc ate gtg etg tta ggg etc ata gee ttt gac aga ttc 508 
Met Tyr Val Gly He Val Leu Leu Gly Leu He Ala Phe Asp Arg Phe 
110 115 120 
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etc aag ate ate aga cct ttg aga aat att ttt eta aaa aaa cct gtt 556 

Leu Lys lie He Arg Pro Leu Arg Asn He Phe Leu Lys Lys Pro Val 

125 130 135 

ttt gca aaa acg gtc tea ate ttc ate tgg gtc ttt ttg gtc ttc ate 604 
Phe Ala Lys Thr Val Ser He Phe He Trp Val Phe Leu Val Phe He 
140 145 150 

tec etg cea aat atg ate ttg age aae aag gaa gca aca eea teg tct 652 
Ser Leu Pro Asn Met He Leu Ser Asn Lys Glu Ala Thr Pro Ser Ser 
155 160 165 

gtg aaa aag tgt get tec tta aag ggg eet ctg ggg etg aaa tgg cat 700 
Val Lys Lys Cys Ala Ser Leu Lys Gly Pro Leu Gly Leu Lys Trp His 
170 175 180 185 

caa atg gta aat aac ata tgc eag ttt att ttc tgg act ggt ttt ate 748 
Gin Met Val Asn Asn He Cys Gin Phe He Phe Trp Thr Gly Phe He 
190 195 200 

eta atg ctt gtg ttt tat gtg gtt att gea aaa aaa gta tat gat tct 796 
Leu Met Leu Val Phe Tyr Val Val He Ala Lys Lys Val Tyr Asp Ser 
205 210 215 

tat aga aag tee aaa agt aag gac aga aaa aae aac aaa aag etg gaa 844 
Tyr Arg Lys Ser Lys Ser Lys Asp Arg Lys Asn Asn Lys Lys Leu Glu 
220 225 230 

ggc aaa gta ttt gtt gtc gtg get gtc ttc ttt gtg tgt ttt get cca 892 
Gly Lys Val Phe Val Val Val Ala Val Phe Phe Val Cys Phe Ala Pro 
235 240 245 

ttt cat ttt gcc aga gtt cca tat act cac agt caa ace aac aat aag 940 
Phe His Phe Ala Arg Val Pro Tyr Thr His Ser Gin Thr Asn Asn Lys 
250 255 260 265 

act gac tgt aga ctg caa aat caa ctg ttt att get aaa gaa aca act 988 
Thr Asp Cys Arg Leu Gin Asn Gin Leu Phe He Ala Lys Glu Thr Thr 
270 275 280 

etc ttt ttg gca gca act aac att tgt atg gat ccc tta ata tac ata 1036 
Leu Phe Leu Ala Ala Thr Asn He Cys Met Asp Pro Leu He Tyr He 
285 290 295 

ttc tta tgt aaa aaa ttc aca gaa aag eta cca tgt atg caa ggg aga 1084 
Phe Leu Cys Lys Lys Phe Thr Glu Lys Leu Pro Cys Met Gin Gly Arg 
300 305 310 

aag ace aca gca tea age caa gaa aat cat age agt eag aca gac aae 1132 
Lys Thr Thr Ala Ser Ser Gin Glu Asn His Ser Ser Gin Thr Asp Asn 
315 320 325 

ata ace tta ggc tga caactgtaea tagggttaac ttctatttat tgatgagact 1187 

He Thr Leu Gly 

330 

tecgtagata atgtggaaat caaatttaac caagaaaaaa agattggaac aaatgctctc 1247 

ttacatttta tttatcctgg tgtceaggaa aagattatat taaatttaaa tccacataga 1307 

tetattcata agctgaatga accattacet aagagaatgc aacaggatae eaatggeeac 1367 

tagaggeata ttccttcttc tttttttttt gttaaattte aagagcattc actttaeatt 1427 
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tggaaagact aaggggaacg gttatcctac aaacctccct tcaacacctt ttacatt 1484 

<210> 8 
<211> 333 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Asn Thr Thr Val Met Gin Gly Phe Asn Arg Ser Glu Arg Cys Pro 
15 10 15 

Arg Asp Thr Arg lie Val Gin Leu Val Phe Pro Ala Leu Tyr Thr Val 
20 25 30 

Val Phe Leu Thr Gly lie Leu Leu Asn Thr Leu Ala Leu Trp Val Phe 
35 40 45 

Val His lie Pro Ser Ser Ser Thr Phe lie lie Tyr Leu Lys Asn Thr 
50 55 60 

Leu Val Ala Asp Leu lie Met Thr Leu Met Leu Pro Phe Lys lie Leu 
65 70 75 80 

Ser Asp Ser His Leu Ala Pro Trp Gin Leu Arg Ala Phe Val Cys Arg 
85 90 95 

Phe Ser Ser Val lie Phe Tyr Glu Thr Met Tyr Val Gly He Val Leu 
100 105 110 

Leu Gly Leu lie Ala Phe Asp Arg Phe Leu Lys He He Arg Pro Leu 
115 120 125 

Arg Asn He Phe Leu Lys Lys Pro Val Phe Ala Lys Thr Val Ser He 
130 135 140 

Phe He Trp Val Phe Leu Val Phe He Ser Leu Pro Asn Met He Leu 
145 150 155 160 

Ser Asn Lys Glu Ala Thr Pro Ser Ser Val Lys Lys Cys Ala Ser Leu 
165 170 175 

Lys Gly Pro Leu Gly Leu Lys Trp His Gin Met Val Asn Asn He Cys 
180 185 190 

Gin Phe He Phe Trp Thr Gly Phe He Leu Met Leu Val Phe Tyr Val 
195 200 205 

Val He Ala Lys Lys Val Tyr Asp Ser Tyr Arg Lys Ser Lys Ser Lys 
210 215 220 

Asp Arg Lys Asn Asn Lys Lys Leu Glu Gly Lys Val Phe Val Val Val 
225 230 235 240 

Ala Val Phe Phe Val Cys Phe Ala Pro Phe His Phe Ala Arg Val Pro 
245 250 255 

Tyr Thr His Ser Gin Thr Asn Asn Lys Thr Asp Cys Arg Leii Gin Asn 
260 265 270 

Gin Leu Phe He Ala Lys Glu Thr Thr Leu Phe Leu Ala Ala Thr Asn 
275 280 285 
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He Cys Met Asp Pro Leu He Tyr 

290 295 

Glu Lys Leu Pro Cys Met Gin Gly 
305 310 

Glu Asn His Ser Ser Gin Thr Asp 
325 



He Phe Leu Cys Lys Lys Phe Thr 
300 

Arg Lys Thr Thr Ala Ser Ser Gin 
315 320 

Asn He Thr Leu Gly 
330 



<210> 9 
<211> 957 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (954) 

<400> 9 

atg atg gtg gat ccc aat ggc aat gaa tec agt get aca tac ttc ate 48 

Met Met Val Asp Pro Asn Gly Asn Glu Ser Ser Ala Thr Tyr Phe He 

15 10 15 



eta ata ggc etc ect ggt tta gaa gag get cag ttc tgg Ltg gcc ttc 96 
Leu He Gly Leu Pro Gly Leu Glu* Glu Ala Gin Phe Trp Leu Ala Phe 
20 25 30 

cca ttg tgc tec etc tac ctt att get gtg eta ggt aac ttg aca ate 144 
Pro Leu Cys Ser Leu Tyr Leu He Ala Val Leu Gly Asn Leu Thr He 
35 40 45 



ate tac att gtg egg act gag cae age ctg cat gag ccc atg tat ata 192 

He Tyr He Val Arg Thr Glu His Ser Leu His Glu Pro Met Tyr He 
50 55 60 

ttt ctt tgc atg ctt tea ggc att gac ate etc ate tec acc tea tec 240 

Phe Leu Cys Met Leu Ser Gly He Asp He Leu He Ser Thr Ser Ser 
65 70 75 80 

atg ccc aaa atg ctg gcc ate ttc tgg ttc aat tec act acc ate cag 288 

Met Pro Lys Met Leu Ala He Phe Trp Phe Asn Ser Thr Thr He Gin 
85 90 95 

ttt gat get tgt ctg eta cag atg ttt gcc ate eac tec tta tct ggc 336 

Phe Asp Ala Cys Leu Leu Gin Met Phe Ala He His Ser Leu Ser Gly 
100 105 110 

atg gaa tec aca gtg ctg ctg gcc atg get ttt gac cgc tat gtg gee 384 

Met Glu Ser Thr Val Leu Leu Ala Met Ala Phe Asp Arg Tyr Val Ala 
115 120 125 



ate tgt eac cca etg egc eat gcc aca gta ctt acg ttg ect cgt gtc 432 
He Cys His Pro Leu Arg His Ala Thr Val Leu Thr Leu Pro Arg Val 
130 135 140 



ace aaa att ggt gtg get get gtg 
Thr Lys He Gly Val Ala Ala Val 
145 150 

ccc ctt ect gtc ttc ate aag cag 
Pro Leu Pro Val Phe He Lys Gin 
165 



gtg egg ggg get gea ctg atg gca 48 0 
Val Arg Gly Ala Ala Leu Met Ala 
155 160 

Ctg ccc ttc tgc cgc tec aat ate 528 
Leu Pro Phe Cys Arg Ser Asn He 
170 175 
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ctt tec cat tec tac tgc eta cac caa gat gtc atg aag ctg gcc tgt 576 

Leu Ser His Ser Tyr Cys Leu His Gin Asp Val Met Lys Leu Ala Cys 

180 185 190 

gat gat ate egg gtc aat gtc gtc tat ggc ctt ate gtc ate ate tec 624 

Asp Asp lie Arg Val Asa Val Val Tyr Gly Leu lie Val lie lie Ser 
195 200 205 

gcc att ggc ctg gac tea ctt etc ate tec ttc tea tat ctg ctt att 672 

Ala lie Gly Leu Asp Ser Leu Leu lie Ser Phe Ser Tyr Leu Leu He 
210 215 220 

ctt aag act gtg ttg ggc ttg aca cgt gaa gee cag gcc aag gea ttt 720 

Leu Lys Thr Val Leu Gly Leu Thr Arg Glu Ala Gin Ala Lys Ala Phe 

225 230 235 240 

ggc act tgc gtc tet cat gtg tgt get gtg ttc ata ttc tat gta cct 768 

Gly Thr Cys Val Ser His Val Cys Ala Val Phe He Phe Tyr Val Pro 
245 250 255 

ttc att gga ttg tec atg gtg eat cge ttt age aag egg cgt gac tet 816 

Phe He Gly Leu Ser Met Val His Arg Phe Ser Lys Arg Arg Asp Ser 

260 265 270 

ccg ctg ccc gtc ate ttg gee aat ate tat ctg ctg gtt cct cct gtg 864 

Pro Leu Pro Val lie Leu Ala Asn He Tyr Leu Leu Val Pro Pro Val 
275 280 285 

etc aac cca att gtc tat gga gtg aag aca aag gag att ega cag cge 912 

Leu Asn Pro He Val Tyr Gly Val Lys Thr Lys Glu He Arg Gin Arg 
290 295 300 

ate ctt ega ctt ttc cat gtg gcc aca cac get tea gag ccc tag 957 

He Leu Arg Leu Phe His Val Ala Thr His Ala Ser Glu Pro 

305 310 315 



<210> 10 

<211> 318 

<212> PRT 

c213> Homo sapiens 

<400> 10 

Met Met Val Asp Pro Asn Gly Asn Glu Ser Ser Ala Thr Tyr Phe He 
1 5-10 15 

Leu He Gly Leu Pro Gly Leu Glu Glu Ala Gin Phe Trp Leu Ala Phe 
20 25 30 

Pro Leu Cys Ser Leu Tyr Leu He Ala Val Leu Gly Asn Leu Thr He 
35 40 45 

He Tyr He Val Arg Thr Glu His Ser Leu His Glu Pro Met Tyr He 
50 55 60 

Phe Leu Cys Met Leu Ser Gly He Asp He Leu He Ser Thr Ser Ser 
65 70 75 80 

Met Pro Lys Met Leu Ala He Phe Trp Phe Asn Ser Thr Thr He Gin 
85 90 95 

Phe Asp Ala Cys Leu Leu Gin Met Phe Ala He His Ser Leu Ser Gly 
100 105 110 
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Met Glu Ser Thr Val Leu Leu Ala Met Ala Phe Asp Arg Tyr Val Ala 
115 120 125 

He Cys His Pro Leu Arg His Ala Thr Val Leu Thr Leu Pro Arg Val 
130 135 140 

Thr Lys He Gly Val Ala Ala Val Val Arg Gly Ala Ala Leu Met Ala 
145 150 155 160 

Pro Leu Pro Val Phe He Lys Gin Leu Pro Phe Cys Arg Ser Asn He 
165 170 175 

Leu Ser His Ser Tyr Cys Leu His Gin Asp Val Met Lys Leu Ala Cys 
180 185 190 

Asp Asp He Arg Val Asn Val Val Tyr Gly Leu He Val He He Ser 
195 200 205 

Ala He Gly Leu Asp Ser Leu Leu He Ser Phe Ser Tyr Leu Leu He 
210 215 220 

Leu Lys Thr Val Leu Gly Leu Thr Arg Glu Ala Gin Ala Lys Ala Phe 
225 230 235 240 

Gly Thr Cys Val Ser His Val Cys Ala Val Phe He Phe Tyr Val Pro 
245 250 255 

Phe He Gly Leu Ser Met Val His Arg Phe Ser Lys Arg Arg Asp Ser 
260 265 270 

Pro Leu Pro Val He Leu Ala Asn He Tyr Leu Leu Val Pro Pro Val 
275 280 285 

Leu Asn Pro He Val Tyr Gly Val Lys Thr Lys Glu He Arg Gin Arg 
290 295 300 

He Leu Arg Leu Phe His Val Ala Thr His Ala Ser Glu Pro 
305 310 315 

<210> 11 
<211> 995 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (921) 

<400> 11 

atg gaa age gag aac aga aga gtg ata aga gaa ttc ate etc ctt ggt 48 

Met Glu Ser Glu Asn Arg Arg Val He Arg Glu Phe He Leu Leu Gly 
1 5 10 15 ' 

ctg acc eag tct caa gat att cag etc ctg gtc ttt gtg eta gtt tta 96 
Leu Thr Gin Ser Gin Asp He Gin Leu Leu Val Phe Val Leu Val Leu 
20 25 30 

ata ttc tac ttc ate ate etc cct gga aat ttt etc att att ttc acc 144 
He Phe Tyr Phe He He Leu Pro Gly Asn Phe Leu He He Phe Thr 
35 40 45 

ata aag tea gac cct ggg etc aea gee cce etc tat ttc ttt ctg ggc 192 
He Lys Ser Asp Pro Gly Leu Thr Ala Pro Leu Tyr Phe Phe Leu Gly 
50 55 60 
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aac ttg gcc ttc ctg gat gca tec tac tec tte att gtg get ecc egg 240 

Asa Leu Ala Phe Leu Asp Ala Ser Tyr Ser Phe lie Val Ala Pro Arg 
65 70 75 80 

atg ttg gtg gac ttc etc tct gcg aag aag ata ate tec tac aga gge 288 
Met Leu Val Asp Phe Leu Ser Ala Lys Lys lie lie Ser Tyr Arg Gly 
85 90 95 

tge ate. act cag etc ttt ttc ttg cac ttc ctt gga gga ggg gag gga 336 
Cys lie Thr Gin Leu Phe Phe Leu His Phe Leu Gly Gly Gly Glu Gly 
100 105 110 

tta etc ctt gtt gtg atg gcc ttt gac cgc tac ate gcc ate tgc egg 384 
Leu Leu Leu Val Val Met Ala Phe Asp Arg Tyr He Ala He Cys Arg 
115 120 125 

act ctg eac tat cet act gte atg aae cct aga acc tgc tat gca atg 432 
Pro Leu His Tyr Pro Thr Val Met Asn Pro Arg Thr Cys Tyr Ala Met 
130 135 140 

atg ttg get ctg tgg ctt ggg ggt ttt gtc cac tee att ate cag gtg 480 
Met Leu Ala Leu Trp Leu Gly Gly Phe Val His Ser He He Gin Val 
145 150 155 160 

gtc etc ate etc cgc ttg cct ttt tgt gge eca aac cag ctg gac aac 528 
Val Leu He Leu Arg Leu Pro Phe Cys Gly Pro Asn Gin Leu Asp Asn 
165 170 175 

tte ttc tgt gat gtc cca cag gtc ate aag ctg gcc tgc ace gac aca 576 
Phe Phe Cys Asp Val Pro Gin Val He Lys Leu Ala Cys Thr Asp Thr 
180 185 190 

ttt gtg gtg gag ctt ctg atg gtc ttc aac agt gge ctg atg aea etc 624 
Phe Val. Val Glu Leu Leu Met Val- Phe Asn Ser Gly Leu Met Thr Leu 
195 200 205 

ctg tge ttt ctg ggg ctt ctg gcc tec tat gca gtc att ctt tgt cgc 672 
Leu Cys Phe Leu Gly Leu Leu Ala Ser Tyr Ala Val He Leu Cys Arg 
210 215 220 

ata cga ggg tct tct tct gag gca aaa aac aag gcc atg tee acg tgc 720 
He Arg Gly Ser Ser Ser Glu Ala Lys Asn Lys Ala Met Ser Thr Cys 
225 230 235 240 

ate acc cat ate att gtt ata ttc. ttc atg ttt gga cet gge ate tte 768 
He Thr His He He Val He Phe Phe Met Phe Gly Pro Gly He Phe 
245 250 255 

ate tac acg cgc cce ttc agg get ttc eca get gac aag gtg gtt tct 816 
He Tyr Thr Arg Pro Phe Arg Ala Phe Pro Ala Asp Lys Val Val Ser 
260 265 270 

etc ttc cac aca gtg att ttt cct ttg ttg aat cct gtc att tat acc 864 
Leu Phe His Thr Val He Phe Pro Leu Leu Asn Pro Val He Tyr Thr 
275 280 285 

ett cgc aac cag gaa gtg aaa get tee atg aaa aag gtg ttt aat aag 912 
Leu Arg Asn Gin Glu Val Lys Ala Ser Met Lys Lys Val Phe. Asn Lys 
290 295 300 

cac ata gcc tgaaaaaggg cgcaaaaaaa aaaagaataa aaatagactg 961 

His He Ala 

305 
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tagaatttct aaaaaaaaaa aaaaaaaaaa aaaa 995 



<210> 12 
<211> 307 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Glu Ser Glu Asn Arg Arg Val lie Arg Glu Phe lie Leu Leu Gly 
15 10 15 

Leu Thr Gin Ser Gin Asp lie Gin Leu Leu Val Phe Val Leu Val Leu 
20 25 30 

He Phe Tyr Phe He He Leu Pro Gly Asn Phe Leu He He Phe Thr 
35 40 45 

He Lys Ser Asp Pro Gly Leu Thr Ala Pro Leu Tyr Phe Phe Leu Gly 
50 55 60 

Asn Leu Ala Phe Leu Asp Ala Ser Tyr Ser Phe He Val Ala Pro Arg 
65 70 75 SO 

Met Leu Val Asp Phe Leu Ser Ala Lys Lys He He Ser Tyr Arg Gly 
85 90 95 

Cys He Thr Gin Leu Phe Phe Leu His Phe Leu Gly Gly Gly Glu Gly 
100 105 110 

Leu Leu Leu Val Val Met Ala Phe Asp Arg Tyr He Ala He Cys Arg 
115 120 125 

Pro Leu His Tyr Pro Thr Val Met Asn Pro Arg Thr Cys Tyr Ala Met 
130 135 140 

Met Leu Ala Leu Trp Leu Gly Gly Phe Val His Ser He He Gin Val 
145 150 155 160 

Val Leu He Leu Arg Leu Pro Phe Cys Gly Pro Asn Gin Leu Asp Asn 
165 170 . 175 

Phe Phe Cys Asp Val Pro Gin Val He Lys Leu Ala Cys Thr Asp Thr 
180 185 190 

Phe Val Val Glu Leu Leu Met Val Phe Asn Ser Gly Leu Met Thr Leu 
195 200 205 

Leu Cys Phe Leu Gly Leu Leu Ala Ser Tyr Ala Val He Leu Cys Arg 
210 215 220 

He Arg Gly Ser Ser Ser Glu Ala Lys Asn Lys Ala Met Ser Thr Cys 
225 230 235 240 

He Thr His He He Val He Phe Phe Met Phe Gly Pro Gly He Phe 
245 250 255 

He Tyr Thr Arg Pro Phe Arg Ala Phe Pro Ala Asp Lys Val Val Ser 
260 265 270 

Leu Phe His Thr Val He Phe Pro Leu Leu Asn Pro Val He Tyr Thr 
275 280 285 
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Leu Arg Asn Gin Glu Val Lys Ala Ser Met Lys Lys Val Phe Asn Lys 
290 295 300 

His He Ala 
305 

<210> 13 
<211> 1380 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (266) . . (1375) 
<220> 

<221> misc^feature 
<222> (32) 

<223> n = A or C or G or T 
<220> 

<221> misc_feature 
<222> (55) 

<223> n = A or C or G or, T 
<220> 

<221> misc_f eature 
<222> (74) 

<223> n = A or C or G or T 
<400> 13 

tgcttcccca taaggtaaca gctttgttag cnctgtctga catcattgct tgttnactta 60 

agaactgata ggtntttttt tttttttttt ttcagatatt ctgatggcaa aacaagtgga 120 

agaaaagagg aagcatgact gcagatcaga tcagttctct ttgtggatta tattttcagt 180 

aaaatgtatg gatctatctt ttccttgttc ttatatctag atcatgagac ttgactgagg 240 

ctgtatcctt atcctccatc catct atg gcg aac tat age cat gca get gac 292 

Met Ala Asn Tyr Ser His Ala Ala Asp 
1 5 



aac att ttg caa aat etc teg cct eta aea gee ttt etg aaa ctg act 340 
Asn He Leu Gin Asn Leu Ser Pro Leu Thr Ala Phe Leu Lys Leu Thr 
10 15 20 25 

tec ttg ggt ttc ata ata gga gtc age gtg gtg gge aac etc etg ate 388 
Ser Leu Gly Phe He He Gly Val Ser Val Val Gly Asn Leu Leu He 
30 35 40 

tec att ttg eta gtg aaa gat aag aee ttg cat aga gca cet tac tac 436 
Ser He Leu Leu Val Lys Asp Lys Thr Leu His Arg Ala Pro Tyr Tyr 
45 50 55 

ttc ctg ttg gat ctt tgc tgt tea gat ate etc aga tct gca att tgt 484 
Phe Leu Leu Asp Leu Cys Cys Ser Asp He Leu Arg Ser Ala He Cys 
60 65 70 

tte cca ttt gtg ttc aac tct gtc aaa aat ggt tct acc tgg act tat 532 
Phe Pro Phe Val Phe Asn Ser Val Lys Asn Gly Ser Thr Trp Thr Tyr 
75 80 85 
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ggg act ctg act tgc aaa gtg att gcc ttt ctg ggg gtt ttg tec tgt 580 
Gly Thr Leu Thr Cys Lys Val lie Ala Phe Leu Gly Val Leu Ser Cys 
90 95 100 105 

ttc cac act get ttc atg etc ttc tgc ate agt gtc ace aga tat tta 628 
Phe His Thr Ala Phe Met Leu Phe Cys lie Ser Val Thr Arg Tyr Leu ■ 
110 115 120 

get ate gee cat cac cgc ttc tat aca aag agg ctg ace ttt tgg aeg 676 
Ala lie Ala His His Arg Phe Tyr Thr Lys Arg Leu Thr Phe Trp Thr 
125 130 135 

tgt ctg get gtg ate tgt atg gtg tgg act ctg tet gtg gcc atg gca 724 
Cys Leu Ala Val lie Cys Met Val Trp Thr Leu Ser Val Ala Met Ala 
140 145 150 

ttt ecc ccg gtt tta gac gtg gge act tac tea ttc att agg gag gaa 772 
Phe Pro Pro Val Leu Asp Val Gly Thr Tyr Ser Phe lie Arg Glu Glu 
155 160 165 

gat caa tgc acc ttc caa cac cgc tec ttc agg get aat gat tec tta 820 
Asp Gin Cys Thr Phe Gin His Arg Ser Phe Arg Ala Asn Asp Ser Leu 
170 175 180 185 

gga ttt atg ctg ett ett get etc ate etc eta gee aca cag ett gtc 868 
Gly Phe Met Leu Leu Leu Ala Leu He Leu Leu Ala Thr Gin Leu Val 
190 195 200 

tac etc aag ctg ata ttt ttc gtc cac gat cga aga aaa atg aag eca 916 
Tyr Leu Lys Leu He Phe Phe Val His Asp Arg Arg Lys Met Lys Pro 
205 210 215 

gtc cag ttt gta gca gca gtc age cag aac tgg act ttt cat ggt cct 964 
Val Gin Phe Val Ala Ala Val Ser Gin Asn Trp Thr Phe His Gly Pro 
220 225 230 



gga gee agt gge cag gca get gcc aat tgg eta gca gga ttt gga agg 1012 

Gly Ala Ser Gly Gin Ala Ala Ala Asn Trp Leu Ala Gly Phe Gly Arg 

235 240 245 

ggt ecc aca cca ecc acc ttg ctg gge ate agg caa aat gca aac ace 1060 

Gly Pro Thr Pro Pro Thr Leu Leu Gly He Arg Gin Asn Ala Asn Thr 
250 255 260 265 



aca gge aga aga agg eta ttg gtc tta gac gag ttc aaa atg gag aaa 1108 
Thr Gly Arg Arg Arg Leu Leu Val Leu Asp Glu Phe Lys Met Glu Lys 
270 275 280 

aga ate age aga atg ttc tat ata atg act ttt ctg ttt eta acc ttg 1156 
Arg lie Ser Arg Met Phe Tyr lie Met Thr Phe Leu Phe Leu Thr Leu 
285 290 295 

tgg gge ecc tac ctg gtg gee tgt tat tgg aga gtt ttt gca aga ggg 12 04 
Trp Gly Pro Tyr Leu Val Ala Cys Tyr Trp Arg Val Phe Ala Arg Gly 
300 305 310 

cct gta gta cca ggg gga ttt eta aca get get gtc tgg atg agt ttt 1252 
Pro Val Val Pro Gly Gly Phe Leu Thr Ala Ala Val Trp Met Ser Phe 
315 320 325 
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gcc caa gca gga ate aat cct ttt gtc tgc att ttc tea aac agg gag 1300 

Ala Gin Ala Gly lie Asn Pro Phe Val Cys Tie Phe Ser Asn Arg Glu 
330 335 340 345 

ctg agg cgc tgt ttc age aca ace ctt ctt tac tgc aga aaa tec agg 1348 

Leu Arg Arg Cys Phe Ser Thr Thr Leu Leu Tyr Cys Arg Lys Ser Arg 

350 355 360 

tta cea agg gaa cct tac tgt gtt ata tgagg 1380 

Leu Pro Arg Glu Pro Tyr Cys Val lie 

365 370 



<210> 14 
<211> 370 
<212> PRT 

<213> Homo sapiens 



<400> 14 
Met Ala Asn Tyr 
1 

Pro Leu Thr Ala 
20 

Val Ser Val Val 

35 

Lys Thr Leu His 
50 

Ser Asp lie Leu 
65 

Val Lys Asn Gly 



He Ala Phe Leu 
100 

Phe Cys He Ser 
115 

Tyr Thr Lys Arg 
130 



Ser His Ala Ala 
5 

Phe Leu Lys Leu 



Gly Asn Leu Leu 
40 

Arg Ala Pro Tyr 
55 

Arg Ser Ala He 
70 

Ser Thr Trp Thr 
85 

Gly Val Leu Ser 



Val Thr Arg Tyr 
120 

Leu Thr Phe Trp 
135 



Asp Asn He Leu 
10 

Thr Ser Leu Gly 
25 

He Ser He Leu 



Tyr Phe Leu Leu 
60 

Cys Phe Pro Phe 
75 

Tyr Gly Thr Leu 
90 

Cys Phe His Thr 
105 

Leu Ala He Ala 



Thr Cys Leu Ala 
140 



Gin Asn Leu Ser 
15 

Phe He He Gly 
30 

Leu Val Lys Asp 
45 

Asp Leu Cys Cys 



Val Phe Asn Ser 
80 

Thr Cys Lys Val 
95 

Ala Phe Met Leu 

110 

His His Arg Phe 
125 

Val He Cys Met 



Val Trp Thr Leu Ser Val Ala Met Ala Phe Pro Pro Val Leu Asp Val 
145 150 155 160 

Gly Thr Tyr Ser Phe He Arg Glu Glu Asp Gin Cys Thr Phe Gin His 
165 170 175 

Arg Ser Phe Arg Ala Asn Asp Ser Leu Gly Phe Met Leu Leu Leu Ala 
180 185 190 

Leu He Leu Leu Ala Thr Gin Leu Val Tyr Leu Lys Leu He Phe Phe 
195 200 205 

Val His Asp Arg Arg Lys Met Lys Pro Val Gin Phe Val Ala Ala Val 
210 215 220 



Ser Gin Asn Trp Thr Phe His Gly Pro Gly Ala Ser Gly Gin Ala Ala 
225 230 235 240 
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Ala Asn Trp Leu 



Leu Gly lie Arg 
260 

Val Leu Asp Glu 
275 

lie Met Thr Phe 
290 

Cys Tyr Trp Arg 
305 

Leu Thr Ala Ala 



Phe Val Cys He 
340 

Thr Leu Leu Tyr 

355 



Ala Gly Phe Gly 
245 

Gin Asn Ala Asn 



Phe Lys Met Glu 
280 

Leu Phe Leu Thr 

295 

Val Phe Ala Arg 
310 

Val Trp Met Ser 
325 

Phe Ser Asn Arg 



Cys Arg Lys Ser 
360 



Arg Gly Pro Thr 
250 

Thr Thr Gly Arg 
265 

Lys Arg He Ser 



Leu Trp Gly Pro 
300 

Gly Pro Val Val 
315 

Phe Ala Gin Ala 
330 

Glu Leu Arg Arg 
345 

Arg Leu Pro Arg 



Pro Pro Thr Leu 
255 

Arg Arg Leu Leu 
270 

Arg Met Phe Tyr 
285 

Tyr Leu Val Ala 



Pro Gly Gly Phe 
320 

Gly He Asn Pro 
335 

Cys Phe Ser Thr 
350 

Glu Pro Tyr Cys 
365 



Val He 
370 



<210> 15 

<211> 1191 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1188) 



<400> 15 

atg ttt aga cct ctt gtg aat etc tct cac ata tat ttt aag aaa ttc 48 
Met Phe Arg Pro Leu Val Asn Leu Ser His He Tyr Phe Lys Lys Phe 
15 10 15 



cag tac tgt ggg tat gca cca cat gtt cgc age tgt aaa cca aac act 96 
Gin Tyr Cys Gly Tyr Ala Pro His Val Arg Ser Cys Lys Pro Asn Thr 
20 25 30 



gat gga att tea tct eta gag aat etc ttg gca age att att cag aga 144 
Asp Gly He Ser Ser Leu Glu Asn Leu Leu Ala Ser He He Gin Arg 
35 40 45 



gta ttt gtc tgg gtt gta tct gca gtt acc tgc ttt gga aac att ttt 192 
Val Phe Val Trp Val Val Ser Ala Val Thr Cys Phe Gly Asn He Phe 
50 55 60 

gtc att tgc atg cga cct tat ate agg tct gag aac aag etg tat gee 240 
Val He Cys Met Arg Pro Tyr He Arg Ser Glu Asn Lys Leu Tyr Ala 
65 70 75 80 



atg tea ate att tct etc tgc tgt gee gac tgc tta atg gga ata tat 
Met Ser He He Ser Leu Cys Cys Ala Asp Cys Leu Met Gly He Tyr 
85 90 95 



288 
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tta ttc gtg ate gga ggc ttt gac eta aag ttt cgt gga gaa tac aat 336 

Leu Phe Val He Gly Gly Phe Asp Leu Lys Phe Arg Gly Glu Tyr Asn 
100 105 110 

aag cat gcg cag ctg tgg atg gag agt act cat tgt cag ctt gta gga 384 
Lys His Ala Gin Leu Trp Met Glu Ser Thr His Cys Gin t>eu Val Gly 
115 120 125 

tct ttg gcc att ctg tec aca gaa gta tea gtt tta ctg tta aca ttt 432 
Ser Leu Ala He Leu Ser Thr Glu Val Ser Val Leu Leu Leu Thr Phe 
130 135 140 

Ctg aca ttg gaa aaa tac ate tgc att gtc tat cct ttt aga tgt gtg 480 
Leu Thr Leu Glu Lys Tyr He Cys He Val Tyr Pro Phe Arg Cys Val 
145 150 155 160 

aga cct gga aaa tgc aga aca att aca gtt ctg att etc att tgg att 528 
Arg Pro Gly Lys Cys Arg Thr He Thr Val Leu He Leu He Trp He 
165 170 175 

act ggt ttt ata gtg get ttc att cca ttg age aat aag gaa ttt ttc 576 
Thr Gly Phe He Val Ala Phe He Pro Leu Ser Asn Lys Glu Phe Phe 
180 185 190 

aaa aac tac tat ggc ace aat gga gta tgc ttc cct ctt cat tea gaa 624 
Lys Asn Tyr Tyr Gly Thr Asn Gly Val Cys Phe Pro Leu His Ser Glu 
195 200 205 

gat aca gaa agt att gga gee cag att tat tea gtg gca att ttt ctt 672 
Asp Thr Glu Ser He Gly Ala Gin He Tyr Ser Val Ala He Phe Leu 
210 215 220 

ggt att aat ttg gee gca ttt ate ate ata gtt ttt tec tat gga age 720 
Gly He Asn Leu Ala Ala Phe He He He Val Phe Ser Tyr Gly Ser 
225 230 235 240. 

atg ttt tat agt gtt cat caa agt gcc ata aca gca act gaa ata egg 768 
Met Phe Tyr Ser Val His Gin Ser Ala He Thr Ala Thr Glu He Arg 
245 250 255 

aat caa gtt aaa aaa gag atg ate ctt gee aaa cgt ttt ttc ttt ata 816 
Asn Gin Val Lys Lys Glu Met He Leu Ala Lys Arg Phe Phe Phe He 
260 265 270 

gta ttt act gat gca tta tgc tgg ata ccc att ttt gta gtg aaa ttt 864 
Val Phe Thr Asp Ala Leu Cys Trp He Pro He Phe Val Val Lys Phe 
275 280 285 

ctt tea ctg ctt cag gta gaa ata cca ggt acc ata ace tct tgg gta 912 
Leu Ser Leu Leu Gin Val Glu He Pro Gly Thr He Thr Ser Trp Val 
290 295 300 

gtg att ttt att ctg ccc att aac agt get ttg aac cca att etc tat 960 
Val He Phe He Leu Pro He Asn Ser Ala Leu Asn Pro He Leu Tyr 
305 310 315 320 

act ctg acc aca aga cca ttt aaa gaa atg att cat egg ttt tgg tat 1008 
Thr Leu Thr Thr Arg Pro Phe Lys Glu Met He His Arg Phe Trp Tyr 
325 330 335 



aac tac aga caa aga aaa tct atg gac age aaa ggt cag aaa aca tat 
Asn Tyr Arg Gin Arg Lys Ser Met Asp Ser Lys Gly Gin Lys Thr Tyr 
340 345 350 



1056 
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gct cca tea ttc ate tgg gtg gaa atg tgg cca ctg cag gag atg oca 1104 
Ala Pro Ser Phe He Trp Val Glu Met Trp Pro Leu Gin Glu Met Pro 
355 360 365 

cct gag tta atg aag ccg gac ctt ttc aca tac ccc tgt gaa atg tea 1152 
Pro Glu Leu Met Lys Pro Asp Leu Phe Thr Tyr Pro Cys Glu Met Ser 
370 375 380 

ctg att tot caa tea acg aga etc aat tec tat tea tga 1191 
Leu He Ser Gin Ser Thr Arg Leu Asn Ser Tyr Ser 
385 390 . 395 



<210> 16 
<211> 396 
<212> PRT 

<213> Homo sapiens 
<400> 16 

Met Phe Arg Pro Leu Val Asn Leu Ser His He Tyr Phe Lys Lys Phe 
15 10 15 

Gin Tyr Cys Gly Tyr Ala Pro His Val Arg Ser Cys Lys Pro Asn Thr 
20 25 30 

Asp Gly He Ser Ser Leu Glu Asn Leu Leu Ala Ser He He Gin Arg 
35 40 45 

Val Phe Val Trp Val Val Ser Ala Val Thr Cys Phe Gly Asn He Phe 
50 55 60 

Val He Cys Met Arg Pro Tyr He Arg Ser Glu Asn Lys Leu Tyr Ala 
65 70 75 80 

Met Ser He He Ser Leu Cys Cys Ala Asp Cys Leu Met Gly He Tyr 
85 90 95 

Leu Phe Val He Gly Gly Phe Asp Leu Lys Phe Arg Gly Glu Tyr Asn 
100 105 110 

Lys His Ala Gin Leu Trp Met Glu Ser Thr His Cys Gin Leu Val Gly 
115 120 125 

Ser Leu Ala He Leu Ser Thr Glu Val Ser Val Leu Leu Leu Thr Phe 
130 135 140 

Leu Thr Leu Glu Lys Tyr He Cys He Val Tyr Pro Phe Arg Cys Val 
145 150 155 160 

Arg Pro Gly Lys Cys Arg Thr He Thr Val Leu He Leu He Trp He 
165 170 175 

Thr Gly Phe He Val Ala Phe He Pro Leu Ser Asn Lys Glu Phe Phe 
ISO 185 190 

Lys Asn Tyr Tyr Gly Thr Asn Gly Val Cys Phe Pro Leu His Ser Glu 
195 200 205 

Asp Thr Glu Ser He Gly Ala Gin He Tyr Ser Val Ala He Phe Leu 
210 215 220 

Gly He Asn Leu Ala Ala Phe He He He Val Phe Ser Tyr Gly Ser 
225 230 235 240 
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Met Phe Tyr Ser Val His Gin Ser Ala lie Thr Ala Thr Glu He Arg 
245 250 255 

Asn Gin Val Lys Lys Glu Met He Leu Ala Lys Arg Phe Phe Phe He 
260 265 270 

Val Phe Thr Asp Ala Leu Cys Trp He Pro He Phe Val Val Lys Phe 
275 280 285 

Leu Ser Leu Leu Gin Val Glu He Pro Gly Thr He Thr Ser Trp Val 
290 295 300 

Val He Phe He Leu Pro He Asn Ser Ala Leu Asn Pro He Leu Tyr 
305 310 315 320 

Thr Leu Thr Thr Arg Pro Phe Lys Glu Met He His Arg Phe Trp Tyr 
325 330 335 

Asn Tyr Arg Gin Arg Lys Ser Met Asp Ser Lys Gly Gin Lys Thr Tyr 
340 345 350 

Ala Pro Ser Phe He Trp Val Glu Met Trp Pro Leu Gin Glu Met Pro 
355 .360 365 

Pro Glu Leu Met Lys Pro Asp Leu Phe Thr Tyr Pro Cys Glu Met Ser 
370 375 380 

Leu He Ser Gin Ser Thr Arg Leu Asn Ser Tyr Ser 
385 390 395 

<210> 17 

<211> 1164 

<212> DWA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (13) . . (1089) 

<400> 17 

cacaactgaa ga atg ggg ttc aac ttg acg ctt gca aaa tta cca aat aac 51 

Met Gly Phe Asn Leu Thr Leu Ala Lys Leu Pro Asn Asn 
15 10 

gag ctg cac ggc caa gag agt cac aat tea ggc aac agg age gac ggg 99 
Glu Leu His Gly Gin Glu Ser His Asn Ser Gly Asn Arg Ser Asp Gly 
15 20 25 

cca gga aag aac acc acc ctt cac aat gaa ttt gac aca att gtc ttg 147 
Pro Gly Lys Asn Thr Thr Leu His Asn Glu Phe Asp Thr He Val Leu 
30 35 40 45 

cca gtg ctt tat etc att ata ttt gtg gca age ate ttg ctg aat ggt 195 
Pro Val Leu Tyr Leu He He Phe Val Ala Ser He Leu Leu Asn Gly 
50 55 60 

tta gca gtg tgg ate ttc ttc cac att agg aat aaa acc age ttc ata 243 
Leu Ala Val Trp He Phe Phe His He Arg Asn Lys Thr Ser Phe He 
65 70 75 

ttc tat etc aaa aac ata gtg gtt gca gac etc ata atg acg ctg aca 291 
Phe Tyr Leu Lys Asn He Val Val Ala Asp Leu He Met Thr Leu Thr 
80 85 90 
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ttt cca ttt cga ata gtc cat gat gca gga ttt gga cct tgg tac ttc 339 
Phe Pro Phe Arg lie Val His Asp Ala Gly Phe Gly Pro Trp Tyr Phe 
95 100 105 

aag ttt att etc tgc aga tac act tea gtt ttg ttt tat gca aac atg . 387 
Lys Phe lie Leu Cys Arg Tyr Thr Ser Val Leu Phe Tyr Ala Asn Met 
110 115 120 125 

tat act tec ate gtg ttc ett ggg ctg ata age att gat cgc tat ctg 435 
Tyr Thr Ser He Val Phe Leu Gly Leu He Ser lie Asp Arg Tyr Leu 
130 135 140 

sag gtg gtc aag cca ttt ggg gac tct egg atg tac age ata ace ttc 483 
Lys Val Val Lys Pro Phe Gly Asp Ser Arg Met Tyr Ser He Thr Phe 
145 150 155 

acg aag gtt tta tct gtt tgt gtt tgg gtg ate atg get gtt ttg tct 531 
Thr Lys Val Leu Ser Val Cys Val Trp Val He Met Ala Val Leu Ser 
160 165 170 

ttg cca aac ate ate ctg aca aat ggt eag eca aca gag gac aat ate 579 
Leu Pro Asn He He Leu Thr Asn Gly Gin Pro Thr Glu Asp Asn He 
175 180 185 

cat gac tge tea aaa ett aaa agt cet ttg ggg gtc aaa tgg eat acg 627 
His Asp Cys Ser Lys Leu Lys Ser Pro Leu Gly Val Lys Trp His Thr 
190 195 200 205 

gca gtc acc tat gtg aac age tge ttg ttt gtg gee gtg etg gtg att 675 
Ala Val Thr Tyr Val Asn Ser Cys Leu Phe Val Ala Val Leu Val He 
210 215 220 

etg ate gga tgt tac ata gee ata tee agg tac ate cac aaa tec age 723 
Leu He Gly Cys Tyr He Ala He Ser Arg Tyr He His Lys Ser Ser 
225 230 235 

agg caa ttc ata agt cag tea age cga aag cga aaa cat aac eag age 771 
Arg Gin Phe He Ser Gin Ser Ser Arg Lys Arg Lys His Asn Gin Ser 
240 245 250 

ate agg gtt gtt gtg get gtg ttt ttt ace tgc ttt eta cca tat cae 819 
He Arg Val Val Val Ala Val Phe Phe Thr Cys Phe Leu Pro Tyr His 
255 260 265 

ttg tgc aga att cct ttt act ttt agt eac tta gac agg ett tta gat 867 
Leu Cys Arg He Pro Phe Thr Phe Ser His Leu Asp Arg Leu Leu Asp 
270 275 280 285 

gaa tct gca caa aaa ate eta tat tac tgc aaa gaa att aea ett ttc 915 
Glu Ser Ala Gin Lys He Leu Tyr Tyr Cys Lys Glu He Thr Leu Phe 
290 295 300 

ttg tct gcg tgt aat gtt tgc ctg gat cca ata att tac ttt ttc atg 963 
Leu Ser Ala Cys Asn Val Cys Leu Asp Pro He He Tyr Phe Phe Met 
305 310 315 

tgt agg tea ttt tea aga agg ctg ttc aaa aaa tea aat ate aga aec 1011 
Cys Arg Ser Phe Ser Arg Arg *Leu Phe Lys Lys Ser Asn He Arg Thr 
320 325 330 

agg agt gaa age ate aga tea ctg caa agt gtg aga aga teg gaa gtt 1059 
Arg Ser Glu Ser He Arg Ser Leu Gin Ser Val Arg Arg Ser Glu Val 
335 340 345 
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ctc ata tat tat gat tat act gat gtg tag gccttttatt gtttgttgga 1109 
Leu lie Tyr Tyr Asp Tyr Thr Asp Val 
350 355 

atcgatatgt acaaagtgta aataaatgtt tcttttcatt aaaaaaaaaa aaaaa 1164 

<210> 18 
<211> 358 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Met Gly Phe Asn Leu Thr Leu Ala Lys Leu Pro Asn Asn Glu Leu His 
15 10 15 

Gly Gin Glu Ser His Asn Ser Gly Asn Arg Ser Asp Gly Pro Gly Lys 
20 25 30 

Asn Thr Thr Leu His Asn Glu Phe Asp Thr lie Val Leu Pro Val Leu 
35 40 • 45 

Tyr Leu lie lie Phe Val Ala Ser He Leu Leu Asn Gly Leu Ala Val 
50 55 60 

Trp lie Phe Phe His He Arg Asn Lys Thr Ser Phe He Phe Tyr Leu 
65 70 75 80 

Lys Asn He Val Val Ala Asp Leu He Met Thr Leu Thr Phe Pro Phe 
85 90 95 

Arg He Val His Asp Ala Gly Phe Gly Pro Trp Tyr Phe Lys Phe He 
100 105 110 

Leu Cys Arg Tyr Thr Ser Val Leu Phe Tyr Ala Asn Met Tyr Thr Ser 
115 120 125 

He Val Phe Leu Gly Leu He Ser He Asp Arg Tyr Leu Lys Val Val 
130 135 140 

Lys Pro Phe Gly Asp Ser Arg Met Tyr Ser He Thr Phe Thr Lys Val 
145 150 155 160 

Leu Ser Val Cys Val Trp Val He Met Ala Val Leu Ser Leu Pro Asn 
165 170 175 

He He Leu Thr Asn Gly Gin Pro Thr Glu Asp Asn He His Asp Cys 
180 185 190 

Ser Lys Leu Lys Ser Pro Leu Gly Val Lys Trp His Thr Ala Val Thr 
195 200 205 

Tyr Val Asn Ser Cys Leu Phe Val Ala Val Leu Val He Leu He Gly 
210 215 220 

Cys Tyr He Ala He Ser Arg Tyr He His Lys Ser Ser Arg Gin Phe 
225 230 235 240 

He Ser Gin Ser Ser Arg Lys Arg Lys His Asn Gin Ser He Arg Val 
245 250 255 

Val Val Ala Val Phe Phe Thr Cys Phe Leu Pro Tyr His Leu Cys Arg 
260 265 270 
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lie Pro Phe Thr Phe Ser His Leu Asp Arg Leu Leu Asp Glu Ser Ala 
275 280 285 

Gin Lys He Leu Tyr Tyr Cys Lys Glu He Thr Leu Phe Leu Ser Ala 
290 295 300 

Cys Asn Val Cys Leu Asp Pro He He Tyr Phe Phe Met Cys Arg Ser 
305 310 315 320 

Phe Ser Arg Arg Leu Phe Lys Lys Ser Asn He Arg Thr Arg Ser Glu 
325 330 335 

Ser He Arg Ser Leu Gin Ser Val Arg Arg Ser Glu Val Leu He Tyr 
340 345 350 

Tyr Asp Tyr Thr Asp Val 
355 

<210> 19 
<211> 2480 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (42) . . (1157) 
<400> 19 

catggcatcc ccagcctagc tcccaatccc actttggcac g atg tta gcc aac age 56 

Met Leu Ala Asn Ser . 
1 5 

tec tea ace aac agt tet gtt etc cog tgt cet gac tae cga cct acc 104 
Ser Ser Thr Asn Ser Ser Val Leu Pro Cys Pro Asp Tyr Arg Pro Thr 
10 15 20 

cac cgc ctg cac ttg gtg gtc tac age ttg gtg ctg get gcc ggg etc 152 
His Arg Leu His Leu Val Val Tyr Ser Leu Val Leu Ala Ala Gly Leu 
25 30 35 

ccc etc aac gcg eta gcc etc tgg gtc ttc ctg cgc gcg ctg cgc gtg 200 
Pro Leu Asn Ala Leu Ala Leu Trp Val Phe Leu Arg Ala Leu Arg Val 
40 45 50 

cac teg gtg gtg age gtg tac atg tgt aac ctg gcg gcc age gac ctg 248 
His Ser Val Val Ser Val Tyr Met Cys Asn Leu Ala Ala Ser Asp Leu 
55 60 65 

etc ttc acc etc teg ctg ccc gtt cgt etc tec tac tac gca ctg cac 296 
Leu Phe Thr Leu Ser Leu Pro Val Arg Leu Ser Tyr Tyr Ala Leu His 
70 75 80 85 

cac tgg ccc ttc ccc gac etc ctg tgc cag acg acg ggc gcc ate ttc 344 
His Trp Pro Phe Pro Asp Leu Leu Cys Gin Thr Thr Gly Ala He Phe 
90 95 100 

cag atg aac atg tac ggc age tgc ate ttc ctg atg etc ate aac gtg 392 
Gin Met Asn Met Tyr Gly Ser Cys He Phe Leu Met Leu He Asn Val 
105 110 115 

gac cgc tac gcc gcc ate gtg cac ceg ctg cga ctg cgc cac ctg egg 440 
Asp Arg Tyr Ala Ala He Val His Pro Leu Arg Leu Arg His Leu Arg 
120 125 130 
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cgg ccc cgc gtg gcg egg ctg etc tgc ctg ggc gtg tgg gcg etc ate 488 
Arg Pro Arg Val Ala Arg Leu Leu Cys Leu Gly Val Trp Ala Leu lie 
135 140 145 

ctg gtg ttt gee gtg ccc gee gee cgc gtg cac agg ccc teg cgt tgc 536 
Leu Val Phe Ala Val Pro Ala Ala Arg Val His Arg Pro Ser Arg Cys 
150 155 160 165 

cgc tac egg gae etc gag gtg cgc eta tgc ttc gag age ttc age gac 584 
Arg Tyr Arg Asp Leu Glu Val Arg Leu Cys Phe Glu Ser Phe Ser Asp 
170 175 ISO 

gag ctg tgg aaa gge agg ctg ctg ccc etc gtg ctg ctg gee gag gcg 632 
Glu Leu Trp Lys Gly Arg Leu Leu Pro Leu Val Leu Leu Ala Glu Ala 
185 190 195 

ctg ggc ttc ctg ctg ccc ctg gcg gcg gtg gtc tac teg teg ggc ega 680 
Leu Gly Phe Leu Leu Pro Leu Ala Ala Val Val Tyr Ser Ser Gly Arg 
200 205 210 

gtc ttc tgg acg ctg gcg cgc ccc gac gee aeg cag age eag egg egg 72 8 
Val Phe Trp Thr Leu Ala Arg Pro Asp Ala Thr Gin Ser Gin Arg Arg 
215 220 225 

egg aag acc gtg cgc etc ctg ctg get aac etc gtc ate ttc ctg ctg 776 
Arg Lys Thr Val Arg Leu Leu Leu Ala Asn Leu Val lie Phe Leu Leu 
230 235 240 245 

tgc ttc gtg ccc tac aac age acg ctg gcg gtc tac ggg ctg ctg egg 824 
Cys Phe Val Pro Tyr Asn Ser Thr Leu Ala Val Tyr Gly Leu Leu Arg 
250 255 260 

age aag ctg gtg gcg gcc age gtg cet gee cgc gat cgc gtg cgc ggg 872 
Ser Lys Leu Val Ala Ala Ser Val Pro Ala Arg Asp Arg Val Arg Gly 
265 270 275 

gtg ctg atg gtg atg gtg ctg ctg gcc ggc gcc aac tgc gtg ctg gac 920 
Val Leu Met Val Met Val Leu Leu Ala Gly Ala Asn Cys Val Leu Asp 
280 285 290 

ccg ctg gtg tac tac ttt age gcc gag ggc ttc cgc aac acc ctg cgc 968 
Pro Leu Val Tyr Tyr Phe Ser Ala Glu Gly Phe Arg Asn Thr Leu Arg 
295 300 305 

ggc ctg ggc act ccg cac egg gcc agg acc teg gcc acc aac ggg aeg 1016 
Gly Leu Gly Thr Pro His Arg Ala Arg Thr Ser Ala Thr Asn Gly Thr 
310 315 320 325 

egg gcg gcg etc gcg caa tec gaa agg tee gee gtc acc acc gac gcc 1064 
Arg Ala Ala Leu Ala Gin Ser Glu Arg Ser Ala Val Thr Thr Asp Ala 
330 335 340 

ace agg ccg gat gcc gcc agt eag ggg ctg etc ega ccc tec gac tec 1112 
Thr Arg Pro Asp Ala Ala Ser Gin Gly Leu Leu Arg Pro Ser Asp Ser 
345 350 355 

cac tct ctg tet tec ttc aca eag tgt ccc cag gat tec gcc etc 1157 
His Ser Leu Ser Ser Phe Thr Gin Cys Pro Gin Asp Ser Ala Leu 
360 365 - 370 

tgaacaeaca tgceattgcg ctgtccgtgc ccgaetccea aegcetcteg ttctgggagg 1217 

cttacagggt gtacacacaa gaaggtggge tgggcacttg gacctttggg tggcaattec 1277 
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agcttagcaa cgcagaagag tacaaagtgt ggaagccagg gcccagggaa ggcagtgctg 1337 

ctggaaatgg cttctttaaa ctgtgagcac gcagagcacc ccttctccag cggtgggaag 1397 

tgatgcagag agcccacccg tgcagagggc agaagaggac gaaatgcctt tgggtgggca 1457 

gggcattaaa ctgctaaaag ctggttagat ggaacagaaa atgggcattc tggatctaaa 1517 

ccgccacagg ggcctgagag ctgaagagca ccaggtttgg tggacaaagc tactgagatg 1577 

cctgttcatc tgctgacttc tgtctaggct catggatgcc accccctttc atttcggcct 1637 

aggcttcccc tgctcaccac tgaggcctaa tacaagagtt cctatggaca gaactacatt 1697 

ctttctcgca tagtgacttg tgacaattta gacttggcat ccagcatggg atagttgggg 1757 

caaggcaaaa ctaacttaga gtttccccct: caacaacatc caagtccaaa ccctttttag 1817 

gttatccttt cttccatcac atcccctttt ccaggcctcc tccattttag gtccttaata 1877 

ttctttcttt ttctctctct ctcgtttctc tcttctctct cctctcctct cctctctctt 1937 

ctcctcttct ctctctctcc ctctctctcc tttgtccaga gtaaggataa aattctttct 1997 

actaaagcac tggttctcaa actttttggt ctcagacccc actcttagaa attgaggatc 2057 

tcaaagagct ttgcttatat tttgttcttt tgatacttac catactagaa attaaagcga 2117 

atacattttt aaaataaata cacatgcaca cattacatta gccatgggag caataatgtc 2177 

accacacaca cttcatgaag cctctggaaa actctacagt atacttgtga gagaatgaga 2237 

gtgaaaggga caaataacat ctgtgtagca gtattatgaa aatagcttga ccttgtggac 2297 

ttcctcagag ggttggtccc tggatcacac tttgagaacc atacttgtcc tgaagtattg 2357 

gagttcatgt ctaacttctt cccagggcat tatgtacagt gctttttatt actgtgggga 2417 

gagggcagtg ctaaataaat taatcactac tgataaaaaa aaaaaaaaaa aaaaaaaaaa 24 77 

aaa 2480 

<210> 20 

<211> 372 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Leu Ala Asn Ser Ser Ser Thr Asn Ser Ser Val Leu Pro Cys Pro 
15 10 15 

Asp Tyr Arg Pro Thr His Arg Leu His Leu Val Val Tyr Ser Leu Val 
20 25 30 

Leu Ala Ala Gly Leu Pro Leu Asn Ala Leu Ala Leu Trp Val Phe Leu 
35 40 45 

Arg Ala Leu Arg Val His Ser Val Val Ser Val Tyr Met Cys Asn Leu 
50 55 60 

Ala Ala Ser Asp Leu Leu Phe Thr Leu Ser Leu Pro Val Arg Leu Ser 
65 70 75 80 
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Tyr Tyr Ala Leu His His Trp Pro Phe Pro Asp Leu Leu Cys Gin Thr 
85 90 95 

Thr Gly Ala lie Phe Gin Met Asn Met Tyr Gly Ser Cys lie Phe Leu 
100 105 110 

Met Leu lie Asn Val Asp Arg Tyr Ala Ala lie Val His Pro Leu Arg 

115 120 125 



Leu Arg His Leu 
130 

Val Trp Ala Leu 
145 

Arg Pro Ser Arg 



Glu Ser Phe Ser 
180 

Leu Leu Ala Glu 
195 

Tyr Ser Ser Gly 
210 



Arg Arg Pro Arg 
135 

lie Leu Val Phe 
150 

Cys Arg Tyr Arg 
165 

Asp Glu Leu Trp 



Ala Leu Gly Phe 
200 

Arg Val Phe Trp 
215 



Val Ala Arg Leu 
140 

Ala Val Pro Ala 
155 

Asp Leu Glu Val 
170 

Lys Gly Arg Leu 
185 

Leu Leu Pro Leu 



Thr Leu Ala Arg 
220 



Leu Cys Leu Gly 



Ala Arg Val His 
160 

Arg Leu Cys Phe 
175 

Leu Pro Leu Val 
190 

Ala Ala Val Val 
205 

Pro Asp Ala Thr 



Gin Ser Gin Arg 
225 

Val lie Phe Leu 



Tyr Gly Leu Leu 
2S0 

Asp Arg Val Arg 
275 



Arg Arg Lys Thr 
230 

Leu Cys Phe Val 
245 

Arg Ser Lys Leu 



Gly Val Leu Met 
280 



Val Arg Leu Leu 
235 

Pro Tyr Asn Ser 
250 

Val Ala Ala Ser 
265 

Val Met Val Leu 



Leu Ala Asn Leu 
240 

Thr Leu Ala Val 
255 

Val Pro Ala Arg 
270 

Leu Ala Gly Ala 
285 



Asn Cys Val Leu 
290 

Arg Asn Thr Leu 
305 

Ala Thr Asn Gly 



Val Thr Thr Asp 
340 

Arg Pro Ser Asp 
355 



Asp Pro Leu Val 
295 

Arg Gly Leu Gly 
310 

Thr Arg Ala Ala 
325 

Ala Thr Arg Pro 



Ser His Ser Leu 
360 



Tyr Tyr Phe Ser 
300 

Thr Pro His Arg 
315 

Leu Ala Gin Ser 
330 

Asp Ala Ala Ser 
345 

Ser Ser Phe Thr 



Ala Glu Gly Phe 



Ala Arg Thr Ser 
320 

Glu Arg Ser Ala 
335 

Gin Gly Leu Leu 
350 

Gin Cys Pro Gin 
365 



Asp Ser Ala Leu 
370 



<210> 21 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Primer LW1282 
<400> 21 

taatacctgc actgcccac 

<210> 22 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW 1283 
<400> 22 

tctttccttc tcttctcact cc 



<210> 23 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW 1373 
<400> 23 

gcataagctt atgctaacac tgaataaaac ag 

<210> 24 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1374 
<400> 24 

gcatctcgag tcacatgctg taggatttgg 

<210> 25 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 25 

Ala Pro Arg Thr Pro Gly Gly Arg Arg 
1 5 



<210> 26 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1248 
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<400> 26 

gcatgaattc caatatactt ccccatacct ac 

<210> 27 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1249 
<400> 27 

gcatggatcc ggaaaagaag gagaagaaag 

<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1278 
<400> 28 

accgctgcct ttttagtc 



<210> 29 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1279 

<400> 29 

ccttctttct gggtacataa gtc 



<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1405 
<400> 30 

aagcataaca tggatgaaac aggaaatctg 

<210> 31 
<211> 29 
<212> DNA 

<213> Artific.i.al Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1406 
<400> 31 

aagcataact atactttaca tatttcttc 
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<210> 32 

<211> 22 , 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer JjW1280 

<:400> 32 

tctgcacaca gctcttccat gg 22 



<210> 33 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1281 



<210> 34 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1385 
<400> 34 

gcataagctt ccatggaact tcataacctg 30 



<210> 35 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1386 



<210> 35 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1329 
<400> 36 

gcatctcgag tcagcctaag gttatgttg 29 



<210> 37 
<211> 29 
<212> DNA 



<400> 33 

tcccttgtcc agttggttga gg 



22 



<400> 35 

gcatctcgag ttacccccac agcgctgcag 



30 



wo 01/31014 



PCT/USOO/29601 



-35 - 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1377 
<400> 37 

gcataagctt atgaacacca cagtgatgc 29 



<210> 38 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1387 
<400> 38 

gagaaatatt tttctaaaaa aacctgtttt tgcaaaaacg g 41 

<210> 39 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1388 
<400> 39 

ccgtttttgc aaaaacaggt ttttttagaa aaatatttct c 41 

<210> 40 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1314 
<400> 40 

gcatgaattc ccaccttcat catctacctc 30 



<210> 41 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1315 
<400> 41 

gcatggatcc gaagaccaaa aagacccag 2 9 



<210> 42 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Dsscription of Artificial Sequence: Primer LW1326 
<400> 42 

gcatgaatcc acgatggtgg atcccaatgg 30 



<210> 43 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1327 
<400> 43 

gcatctcgag cctagggctc tgaagcg 27 



<210> 44 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1415 
<400> 44 

ccatgtatat atttctttgc atgctttcag gcattgacat cc 42 



<210> 45 
<211> 42 
<212> DNA 

<:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1416 
<400> 45 

ggatgtcaat gcctgaaagc atgcaaagaa atatatacat gg 42 



<210> 46 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer LW1308 
<400> 46 

gcatgaattc actcacttct catctccttc 30 



<210> 47 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1309 



<400> 47 
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gcatggatcc aatctccttt gtcttcactc 



<210> 48 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1324 
<400> 48 

gatcggatcc atggaaagcg agaacag 

<210> 49 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1325 
<400> 49 

gatcctcgag tcaggctatg tgcttattaa acacc 



<210> 50 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1306 
<400> 50 

gcatgaattc ttctacttca tcatcctcc 



<210> 51 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1307 
<400> 51 

gcatggatcc aaaggccatc acaacaag 



<210> 52 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer GV599 
<400> 52 

ggcagaagaa ggctattggt cttagacgag 
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<210> 53 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer GV600 
<400> 53 

ctgaaacagc gcctcagctc cc 

<210> 54 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1482 
<400> 54 

agctatggcg aactatagcc atgcagc 



<210> 55 
<211> 27 
<212> DNA 

<;213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW148 
<400> 55 

agtcctcata taacacagta aggttcc 



<210> 56 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer bWI310 
<400> 55 

gcatgaattc gcagaagaag gctattgg 



<210> 57 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1311 
<400> 57 

gcatggatcc gcagtaaaga agggttgtg 



<210> 58 
<211> 19 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer LW1442 



<400> 58 

gccattctgt ccacagaag 



19 



<210> 59 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1443 
<400> 59 

tcagttgctg ttatggcac 19 



<210> 60 
<211> 24 
<212> DNA' 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1440 
<400> 60 

aagcggatgt ttagacctct tgtg 
24 



<210> 61 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1441 
<400> 61 

aacagtcatg aataggaatt gag 23 



<210> 62 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1472 



<400> 62 

gcatgaattc tgccatgtca atcatttctc tc 



32 



<210> 63 
<211> 31 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1473 
<400> 63 

gcatggatcc gttctgcatt ttccaggtct c 



<210> 64 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer LW1411 
<400> 64 

gcatgaattc tgccaaacat catcctgac 

<210> 65 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1412 
<400> 65 

gcatggatcc tacacagcca caacaaccc 

<210> 66 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1448 
<400> 66 

aagcggtacc atgttagcca acagctcctc 

<210> 67 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1449 
<400> 67 

aagctctaga tcagagggcg gaatcctgg 



<210> 68 
<211> 43 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 217A 
<400> 68 

taggtcggta gtcaggacac gggagaacag aactgttggt tga 43 

<210> 69 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<;220> 

<223> Description of Artificial Sequence: Primer 217B 
<400> 69 

gcccctgtgg cggtttagat ccagaatgcc cattttctgt tccatctaac ca 52 

<210> 70 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1480 
<400> 70 

ggttctacct ggacttatgg 20 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer LW1481 
<400> 71 

taatgaatga gtaagtgccc 20 

<210> 72 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer CON103a 
<400> 72 

tttattaata. ttggaaggga caaactggag agcacagaac at 42 

<210> 73 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer CON103b 
<400> 73 



